Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Oct 22, 2018
Open Peer Review Period: Oct 25, 2018 - Nov 29, 2018
Date Accepted: Mar 30, 2019
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Early Detection of Depression: Social Network Analysis and Random Forest Techniques

Cacheda F, Fernandez D, Novoa FJ, Carneiro V

Early Detection of Depression: Social Network Analysis and Random Forest Techniques

J Med Internet Res 2019;21(6):e12554

DOI: 10.2196/12554

PMID: 31199323

PMCID: 6598420

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Early Detection of Depression: Social Network Analysis and Random Forest Techniques

  • Fidel Cacheda; 
  • Diego Fernandez; 
  • Francisco J Novoa; 
  • Victor Carneiro

Background:

Major depressive disorder (MDD) or depression is among the most prevalent psychiatric disorders, affecting more than 300 million people globally. Early detection is critical for rapid intervention, which can potentially reduce the escalation of the disorder.

Objective:

This study used data from social media networks to explore various methods of early detection of MDDs based on machine learning. We performed a thorough analysis of the dataset to characterize the subjects’ behavior based on different aspects of their writings: textual spreading, time gap, and time span.

Methods:

We proposed 2 different approaches based on machine learning singleton and dual. The former uses 1 random forest (RF) classifier with 2 threshold functions, whereas the latter uses 2 independent RF classifiers, one to detect depressed subjects and another to identify nondepressed individuals. In both cases, features are defined from textual, semantic, and writing similarities.

Results:

The evaluation follows a time-aware approach that rewards early detections and penalizes late detections. The results show how a dual model performs significantly better than the singleton model and is able to improve current state-of-the-art detection models by more than 10%.

Conclusions:

Given the results, we consider that this study can help in the development of new solutions to deal with the early detection of depression on social networks.


 Citation

Please cite as:

Cacheda F, Fernandez D, Novoa FJ, Carneiro V

Early Detection of Depression: Social Network Analysis and Random Forest Techniques

J Med Internet Res 2019;21(6):e12554

DOI: 10.2196/12554

PMID: 31199323

PMCID: 6598420

Per the author's request the PDF is not available.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.