Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Apr 25, 2022
Open Peer Review Period: Apr 22, 2022 - May 16, 2022
Date Accepted: Aug 1, 2022
(closed for review but you can still tweet)
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
A Machine Learning Approach for Continuous Mining of Non-identifiable Smartphone Data to Create a Novel Digital Biomarker Detecting Generalized Anxiety Disorder: A Feasibility Study
ABSTRACT
Background:
Anxiety is one of the leading causes of mental health disability around the world. Currently, a majority of the population who experience anxiety go undiagnosed or untreated. New and innovative ways of diagnosing and monitoring anxiety have emerged using smartphone sensor-based monitoring as a metric for the management of anxiety. This is a novel study as it adds to the field of research through the use of non-identifiable smartphone usage to help detect and monitor anxiety remotely and in a continuous and passive manner.
Objective:
The aim of this study is to demonstrate the feasibility of a daily mental health behavioural profiling metric called Mental Health Similarity Score for anxiety generated by mining non-identifiable smartphone data.
Methods:
Smartphone data and self-reported 7-item GAD anxiety assessments were collected from 229 participants on the Android operating system in an observational study over an average of 14 days (SD=29.8). 34 features were mined to be constructed as a potential digital phenotyping marker from continuous smartphone usage data. We further analyzed the correlation of these digital behavioral markers against each item of the GAD-7 and its influence on the machine learning algorithms predictions.
Results:
A total of 229 participants were recruited in this study that had completed the GAD-7 assessment and had at least one set of passive digital data collected within a 24-hour period. The mean GAD-7 score was 11.8 (SD=5.7). Regression modeling was tested against classification modeling and the highest prediction accuracy was achieved from a binary Random Forest classification model (Precision 89-92%; Recall 85-94%; F188-91%; accuracy 90%; AUC 96%). Non-parametric permutation testing with Pearson correlation results indicated the proposed metric (MHSS) had the strongest relationship between GAD-7 items 1,3, and 7.
Conclusions:
The proposed Mental Health Similarity Score (MHSS) metric demonstrates the feasibility of using passive non- intrusive smartphone data and machine learning-based data mining techniques to track an individual’s daily anxiety levels with a 90% accuracy that directly related to the Generalized Anxiety Disorder-7 scale.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.