JMIR Preprints #69142: Towards a clinically actionable, electronic health record-based machine learning model to forecast 90-day change in hemoglobin A1c in youth with type 1 diabetes: Feasibility and model development study

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Towards a clinically actionable, electronic health record-based machine learning model to forecast 90-day change in hemoglobin A1c in youth with type 1 diabetes: Feasibility and model development study

Erin M Tallon;
David D Williams;
Cintya Schweisberger;
Colin Mullaney;
Brent Lockee;
Diana Ferro;
Craig A Vandervelden;
Mitchell S Barnes;
Angelica Cristello Sarteau;
Anna R Kahkoska;
Susana R Patton;
Sanjeev Mehta;
Ryan McDonough;
Marcus Lind;
Leonard D'Avolio;
Mark A Clements

ABSTRACT

Background:

Clinicians currently lack an effective means for identifying youth with type 1 diabetes (T1D) who are at risk for experiencing glycemic deterioration between diabetes clinic visits. As a result, their ability to identify youth who may optimally benefit from targeted interventions designed to address rising glycemic levels is limited. Although electronic health records (EHR)-based risk predictions have been used to forecast some health outcomes in T1D, no study has investigated the potential for using EHR data to identify youth with T1D who will experience a clinically significant rise in HbA1c ≥0.3% (~3 mmol/mol) between diabetes clinic visits.

Objective:

We evaluated the feasibility of using routinely collected EHR data to develop a machine learning model to predict 90-day unit-change in HbA1c (in % units) in youth (ages 10-17) with T1D. We assessed our model's ability to augment clinical decision-making by identifying a percent change cut-point that optimized identification of youth who would experience a clinically significant rise in HbA1c.

Methods:

From a cohort of 2,757 youth with T1D who received care from a network of pediatric diabetes clinics in the Midwestern United States (January 2012-August 2017), we identified 1,743 youth with 9,643 HbA1c observation windows (i.e., 2 HbA1c measurements separated by 70-110 days, approximating the 90-day time interval between routine diabetes clinic visits). We used up to 5 years of youths' longitudinal EHR data to transform 17,466 features (demographics, laboratory results, vital signs, anthropometric measures, medications, diagnosis codes, procedure codes, and free text data) for model training. We performed three-fold cross-validation to train random forest regression models to predict 90-day unit-change in HbA1c(%).

Results:

Across all 3 folds of our cross-validation model, average root mean squared error was 0.88 (95% CI, 0.85-0.90). Predicted HbA1c(%) strongly correlated with true HbA1c(%) (r=0.79; 95% CI, 0.78-0.80). The top 10 features impacting model predictions included postal code, various metrics related to HbA1c, and the frequency of a diagnosis code indicating difficulty with treatment engagement. At a clinically significant percent rise threshold of ≥0.3% (~3 mmol/mol), our model's positive predictive value (PPV) was 60.3%, indicating a 1.5-fold enrichment (relative to the observed frequency that youth experienced this outcome [40.7%]). Model sensitivity and PPV improved when thresholds for clinical significance included smaller changes in HbA1c, whereas specificity and negative predictive value improved when thresholds required larger changes in HbA1c.

Conclusions:

Routinely collected EHR data can be used to create an ML model for predicting unit-change in HbA1c between diabetes clinic visits among youth with T1D. Future work will focus on optimizing model performance and validating the model in additional cohorts and in other diabetes clinics.

Citation

Please cite as:

Tallon EM, Williams DD, Schweisberger C, Mullaney C, Lockee B, Ferro D, Vandervelden CA, Barnes MS, Sarteau AC, Kahkoska AR, Patton SR, Mehta S, McDonough R, Lind M, D'Avolio L, Clements MA

Toward a Clinically Actionable, Electronic Health Record–Based Machine Learning Model to Forecast 90-Day Change in Hemoglobin A1c in Youth With Type 1 Diabetes: Feasibility and Model Development Study

JMIR Diabetes 2025;10:e69142

DOI: 10.2196/69142

PMID: 40997334

PMCID: 12463387

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Diabetes

Date Submitted: Jan 10, 2025

Open Peer Review Period: Jan 22, 2025 - Mar 19, 2025

Date Accepted: Jul 23, 2025

(closed for review but you can still tweet)

Towards a clinically actionable, electronic health record-based machine learning model to forecast 90-day change in hemoglobin A1c in youth with type 1 diabetes: Feasibility and model development study

ABSTRACT

Citation

Copyright