Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: Apr 5, 2023
Date Accepted: Jul 3, 2024
Evaluation of Machine Learning to Detect Influenza Using Wearable Sensor Data and Patient-Reported Symptoms: A Cohort Study
ABSTRACT
Background:
Machine learning offers quantitative pattern recognition analysis of wearable device data and has the potential to detect illness onset and monitor influenza-like illness (ILI) in infected patients.
Objective:
To evaluate the ability of machine learning algorithms to distinguish between influenza-positive and influenza-negative participants in a cohort of symptomatic ILI patients using wearable sensor (activity) data and self-reported symptom data during the latent and early symptomatic period of ILI.
Methods:
This cohort study used the extreme gradient boosting (XGBoost) classifier to determine whether a participant was influenza positive or negative based on three models: symptoms data only; activity data only; and combined symptoms and activity data. Data were collected from the Home Testing of Respiratory Illness (HTRI) study and FluStudy2020, both conducted between December 2019 and October 2020. Analyses included participants in these studies with an at-home influenza diagnostic test result. Fitbit devices were used to measure participants’ steps, heart rate, and sleep data. Participants detailed their ILI symptoms, healthcare-seeking behaviors, and quality of life. Model performance was assessed by: area under the curve (AUC), balanced accuracy, recall (sensitivity), precision (positive predictive value [PPV]), and F2 (weighted harmonic mean of precision and recall) score.
Results:
An influenza diagnostic test result was available for 953 and 925 participants in HTRI and FluStudy2020 respectively, of whom 848 and 840 had activity data. The highest-performing model used symptoms and activity data (test set means: 0.74 AUC, 0.68 balanced accuracy, 0.70 sensitivity, 0.38 PPV, 0.60 F2). The symptoms-only model had the second-best performance (0.74 AUC, 0.69 balanced accuracy, 0.65 sensitivity, 0.42 PPV, 0.58 F2). The top features guiding influenza detection were cough, mean resting heart rate during main sleep, and fever for the combined model, and cough, fever, and chills for the symptoms-only model.
Conclusions:
Machine learning algorithms had insufficient accuracy to detect influenza, suggesting that previous findings from research-grade sensors tested in highly controlled experimental settings may not easily translate with scalable commercial-grade sensors. In the future, more advanced wearable sensors may improve their performance in the early detection and discrimination of viral respiratory infections. Clinical Trial: NCT04245800
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.