Accepted for/Published in: JMIR Formative Research
Date Submitted: Sep 24, 2025
Open Peer Review Period: Sep 24, 2025 - Sep 24, 2025
Date Accepted: Mar 31, 2026
(closed for review but you can still tweet)
Prediction of Clinically Significant Depressive Symptoms at 2-Year Follow-Up in Older Adults: Machine Learning Study Using the English Longitudinal Study of Ageing (ELSA)
ABSTRACT
Background:
Depression in older adults is often underdiagnosed due to atypical symptom presentation and generational stigma, leading to delayed intervention. Early identification is critical to improve outcomes, but traditional approaches have limited predictive accuracy.
Objective:
This study aimed to develop and evaluate machine learning models to predict the two-year onset of depression in older adults using data from the English Longitudinal Study of Ageing (ELSA), and to identify key risk factors that contribute to depression onset.
Methods:
Data were drawn from four consecutive waves of ELSA, including participants aged 50 years and above who were free of baseline depression and prior psychiatric diagnoses. Over 120 sociodemographic, psychological, and health-related features were analyzed. We applied a range of traditional classifiers (Random Forest, XGBoost, Support Vector Machines) and deep tabular models (TabTransformer, TabNet). Model performance was assessed using area under the receiver operating characteristic curve (AUROC) and F1-score. Model interpretability was examined using SHapley Additive exPlanations (SHAP) to identify the most influential features.
Results:
Across waves, the best-performing models achieved mean AUROC scores of 0.72–0.73, with a peak of 0.75 in the highest-performing wave. SHAP consistently identified age, loneliness, sleep disturbances, and low social engagement as strong predictors of depression onset. Traditional machine learning models (Random Forest, XGBoost, Support Vector Machines) generally outperformed deep learning models for this task.
Conclusions:
Our findings demonstrate the feasibility of using machine learning to predict future depression onset in older adults with moderate accuracy. The identification of consistent risk factors highlights opportunities for developing targeted clinical screening tools and preventive interventions. This study provides novel evidence on depression prediction in the UK context, leveraging longitudinal data from ELSA, and contributes to advancing digital mental health research for aging populations.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.