Accepted for/Published in: Interactive Journal of Medical Research
Date Submitted: Sep 30, 2023
Date Accepted: Sep 24, 2024
Predicting depressive symptoms using GPS-based regional data: A study with the CORONA HEALTH app during the COVID-19 pandemic in Germany
ABSTRACT
Background:
Numerous studies have been conducted to predict depressive symptoms using passive smartphone data, mostly integrating the GPS signal as a measure of mobility. Environmental factors have been identified as correlated with depressive symptoms in specialized studies both before and during the pandemic.
Objective:
The present study combined a data-based approach using passive smartphone data to predict self-reported depressive symptoms with a wide range of GPS-based environmental factors as predictors.
Methods:
The CORONA HEALTH app was developed for the purpose of data collection, and this app enabled the collection of both survey and passive data via smartphone. After obtaining informed consent, we gathered GPS signals at the time of study participation and evaluated depressive symptoms in 249 Android users with the Patient Health Questionnaire (PHQ-9). Data collection took place from July 2020 to February 2021, coinciding with the COVID-19 pandemic. Using GPS data, each dataset was linked to a wide variety of data on regional sociodemographic, geographic and economic characteristics describing the respondent's environment, which were derived from a publicly accessible database from official German statistical offices. Moreover, pandemic-specific predictors such as the current pandemic phase or the number of new regional infections were matched via GPS. For the prediction of individual depressive symptoms, we compared three models (i.e., ridge, lasso and elastic net regression) and evaluated the models using tenfold cross-validation.
Results:
The final elastic net regression model showed the highest explained variance (R-squared = .06) and reduced the dataset from 121 to 9 variables, the three main predictors being current COVID-19 infections in the respective district, the number of places in nursing homes, and the proportion of fathers receiving parental benefits. The number of places in nursing homes refers to the availability of care facilities for the elderly, which may indicate regional population characteristics that influence mental health. The proportion of fathers receiving parental benefits reflects family structure and work-life balance, which could impact stress and mental well-being during the pandemic.
Conclusions:
Passive data describing the environment contributed to the prediction of individual depressive symptoms and revealed regional risk and protective factors that may be of interest without their inclusion in routine assessments being costly.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.