Currently submitted to: JMIR Infodemiology
Date Submitted: Nov 5, 2025
Open Peer Review Period: Nov 24, 2025 - Jan 19, 2026
(closed for review but you can still tweet)
NOTE: This is an unreviewed Preprint
Warning: This is a unreviewed preprint (What is a preprint?). Readers are warned that the document has not been peer-reviewed by expert/patient reviewers or an academic editor, may contain misleading claims, and is likely to undergo changes before final publication, if accepted, or may have been rejected/withdrawn (a note "no longer under consideration" will appear above).
Peer review me: Readers with interest and expertise are encouraged to sign up as peer-reviewer, if the paper is within an open peer-review period (in this case, a "Peer Review Me" button to sign up as reviewer is displayed above). All preprints currently open for review are listed here. Outside of the formal open peer-review period we encourage you to tweet about the preprint.
Citation: Please cite this preprint only for review purposes or for grant applications and CVs (if you are the author).
Final version: If our system detects a final peer-reviewed "version of record" (VoR) published in any journal, a link to that VoR will appear below. Readers are then encourage to cite the VoR instead of this preprint.
Settings: If you are the author, you can login and change the preprint display settings, but the preprint URL/DOI is supposed to be stable and citable, so it should not be removed once posted.
Submit: To post your own preprint, simply submit to any JMIR journal, and choose the appropriate settings to expose your submitted version as preprint.
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Early Depression Detection in Social Media: Monitoring of Individual Nighttime Dynamics and LLM Analysis
ABSTRACT
Background:
Depression has become a major global public health challenge, and early intervention is critical for improving patient outcomes. Current depression detection techniques based on social media data (Traditional Risk Detection, TRD) rely heavily on users’ complete historical information, which cannot meet the timeliness requirements of early intervention. This underscores the need for Early Risk Detection (ERD) methods emphasizing early-stage and real-time warning. However, existing ERD studies have notable limitations: (1) they overlook sleep–wake rhythms hidden in posting timestamps, missing vital warning signals; and (2) they depend on static templates or resource-intensive sequence models, resulting in limited interpretability and inefficient use of early data, ultimately constraining their clinical applicability for early intervention.
Objective:
To address these issues, this study aims to develop an efficient, reliable, and interpretable ERD model. The core objectives are: to extract sleep–wake rhythm features from posting timestamps, thereby to enrich the feature dimensions for risk warning; to leverage large language models (LLM) for improved text filtering precision and depression-related factor analysis; and ultimately to achieve accurate early detection of depression, supporting early clinical intervention.
Methods:
We propose the Monitoring of Individual Nighttime Dynamics and LLM Analysis (MIND) model, which integrates two key innovations: (1)Sleep Dynamics: posting timestamps are transformed into sleep–wake rhythms, analyzing fluctuations in posting frequency and time to derive sleep-related features, thereby compensating for the limitations of text-only approaches. (2)LLM Depression Profiler: LLM is used for dynamic text filtering, automatically removing irrelevant noise and focusing on potential depression-related cues. Based on LLM semantic understanding, latent depression risk factors are identified, enhancing interpretability for clinical treatment and robustness to noise.
Results:
Experiments on the eRisk2017 benchmark dataset demonstrated that MIND significantly outperformed existing baseline models in early detection sensitivity, specificity, and accuracy. By combining sleep features with text analysis, the model achieved interpretable, traceable predictions that can support clinical treatment. Relevant experimental code is publicly available
Conclusions:
The MIND model innovatively combines sleep–wake rhythm features with LLM-based text analysis, addressing the challenges of poor interpretability and inefficient use of early-stage data in existing ERD methods. It significantly enhances early detection performance, offering a new paradigm for applying social media data in ERD task, thereby enabling earlier intervention and reducing the public health burden of depression.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.