Accepted for/Published in: JMIR Public Health and Surveillance
Date Submitted: Apr 27, 2024
Date Accepted: Dec 24, 2024
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
A trustworthy, data-driven, decision-making framework for global infectious disease mitigation
ABSTRACT
Infectious diseases (ID) have a significant detrimental impact on global health. Timely and accurate infectious disease forecasting can result in more informed implementation of control measures and prevention policies. However, for operational decision-making in real-world circumstances, a standardized, reliable, and trustworthy platform that can forecast multiple diseases across different geographic locations is largely lacking. To meet this need, we built an ID forecasting pipeline and visualization dashboard generalized across a wide range of modeling techniques, IDs, and global locations. We included a wide range of statistical, machine learning, and deep learning models and trained them on a multitude of features (e.g., demography, landscape, climate, and socioeconomic factors) within the One Health landscape. The dashboard was built to report crucial operational metrics - prediction accuracy, computational efficiency, spatio-temporal generalizability, uncertainty quantification, and interpretability - which are essential to strategic data-driven decisions. While no single best model was suitable for all disease, region, and country combinations, our ensemble technique selects the best model for any given scenario, achieving peak forecasting performance. For new or emerging diseases in a region, the ensemble model can predict how the disease may behave in the new region using a pre-trained model from a similar region with a history of that disease. The data visualization dashboard provides an interactive, clean interface of important analytical metrics, such as ID temporal patterns, forecasts, prediction uncertainties, and model feature importance across geographic locations and disease combinations. As the need for real-time, operational ID forecasting capabilities increases, this standardized and automated platform for overall data collection, analysis, and reporting is a major step forward in enabling evidence-based public health decisions and policies for prevention and mitigation of future ID outbreaks.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.