Currently accepted at: JMIR Formative Research
Date Submitted: Oct 20, 2025
Open Peer Review Period: Oct 20, 2025 - Dec 15, 2025
Date Accepted: Mar 17, 2026
(closed for review but you can still tweet)
This paper has been accepted and is currently in production.
It will appear shortly on 10.2196/86203
The final accepted version (not copyedited yet) is in this tab.
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Maternal health Aggregated Trends can be Misleading: The power of N-of-1 Level Wearable Data Analysis for Personalized Pregnancy Monitoring
ABSTRACT
Background:
Personal digital health technologies (DHTs) enable real-time monitoring of physiological metrics and behavioral data, including HRV, supporting early detection of pregnancy-related conditions and personalized care throughout the perinatal period. While recent studies demonstrate the utility of personal DHTs in tracking pregnancy-related symptoms, they often rely on aggregate statistical methods that overlook individual variability.
Objective:
To compare aggregate and individual-level analyses of digital health technology (DHT) data for early detection of pregnancy-related conditions, using the comprehensive BUMP dataset to highlight the importance of individual variability and data heterogeneity.
Methods:
This BUMP study (Jan 2021 – May 2022) analyzed physiological and behavioral metrics, such as heart rate variability (HRV), sleep, and fatigue, in 256 individuals using Oura rings and self-reported surveys. Individual-level (N-of-1) trajectories were evaluated and compared with aggregate results to uncover personal and collective trends. A statistical method was developed to assess the influence of adverse events and severe symptoms, while case studies explored confounding and modifying factors underlying heterogeneity. Comprehensive statistical analysis included the coefficient of determination, Kolmogorov-Smirnov tests, likelihood ratio tests, and Welch’s t-tests, with inter-individual variability flagged based on high-variability thresholds.
Results:
Results revealed significant variability in HRV, sleep, and fatigue throughout pregnancy. For instance, only 4.76% of individuals had HRV inflection points at the aggregate week 33 inflection, with a 14.24% coefficient of variation. Our analysis found no significant p-values for demographic or pregnancy complication-based subgrouping, suggesting these factors alone do not drive the observed variability. Case studies further highlighted both intra- and inter-individual differences, emphasizing the importance of considering external factors like adverse events and severe symptoms.
Conclusions:
Our findings show that aggregate wearable data often fails to generalize across populations, oversimplifying pregnancy-related physiological and subjective changes. This simplification can obscure individual trajectories, leading to generalized insights that may not reflect many pregnant women's experiences. Our results highlight the impact of heterogeneity on pregnancy outcomes, emphasizing the need to move beyond one-size-fits-all models and leverage DHT for personalized care.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.