Accepted for/Published in: JMIR mHealth and uHealth
Date Submitted: Sep 26, 2021
Date Accepted: May 13, 2022
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Predicting Next-Day Perceived and Physiological Stress of Pregnant Women Using Machine Learning and Explainability: Algorithm Development and Validation
ABSTRACT
Background:
Cognitive behavioral therapy (CBT)-based interventions are effective in reducing prenatal stress, which can have severe adverse health effects on mother and newborn if unaddressed. Predicting next-day physiologic or perceived stress can help to inform and enable preemptive interventions for a likely physiologically and/or perceptibly stressful day. Machine learning models are useful tools that can be developed to predict next-day physiologic and perceived stress using data collected the previous day. Such models can improve our understanding of the specific factors that predict physiologic and perceived stress and will also allow researchers to develop systems that collect selected features for assessment for clinical trials in order to minimize the burden of data collection.
Objective:
To build and evaluate a machine-learned model that predicts next-day physiologic and perceived stress using sensor-based, ecological momentary assessment (EMA)-based, and intervention-based features and to explain the prediction results.
Methods:
We enrolled pregnant women into a prospective proof-of-concept study and collected electrocardiography, EMA, and CBT intervention data over 12 weeks. We used the data to train and evaluate six machine learning models to predict next-day physiologic and perceived stress. After selecting the best performing model, SHapley Additive exPlanations (SHAP) were used to identify feature importance and explainability of each feature.
Results:
A total of 16 pregnant women enrolled in the study. Overall, 4157.18 hours of data were collected, and participants answered 2838 EMAs. After applying feature selection, 8 and 10 features were found to positively predict next-day physiologic and perceived stress, respectively. A random forest classifier performed the best in predicting next-day physiologic (F1-score 0.84) and next-day perceived stress (F1-score 0.74) using all features. While any subset of sensor-based, EMA-based, and/or intervention-based features could reliably predict next-day physiologic stress, EMA-based features were necessary to predict next-day perceived stress. Analysis of explainability metrics showed that prolonged duration of physiologic stress was highly predictive of next-day physiologic stress and that physiologic stress and perceived stress were temporally divergent.
Conclusions:
In this study we were able to build interpretable machine learning models to predict next-day physiologic and perceived stress, and we identify unique features that were highly predictive of next-day stress that can help reduce the burden of data collection.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.