JMIR Preprints #82635: Predictors of glycaemic response to sulphonylurea therapy in type 2 diabetes: A comparative analysis of linear regression and machine learning models

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Predictors of glycaemic response to sulphonylurea therapy in type 2 diabetes: A comparative analysis of linear regression and machine learning models

Shilpa Garg;
Robert Kitchen;
Ramneek Gupta;
Emanuele Trucco;
Ewan Pearson

ABSTRACT

Background:

Sulphonylureas are commonly prescribed for managing type 2 diabetes, yet treatment responses vary significantly among individuals. Although advances in machine learning (ML) may enhance predictive capabilities compared to traditional statistical methods, their practical utility in real-world clinical environments remains uncertain.

Objective:

This study aimed to evaluate and compare the predictive performance of linear regression models with several ML approaches for predicting glycaemic response to sulphonylurea therapy using routine clinical data.

Methods:

A cohort of 7,557 individuals with type 2 diabetes who initiated sulphonylurea therapy was analysed, with all patients followed for one year. Linear and logistic regression models were used as baseline comparisons. A range of ML models was trained to predict the continuous change in HbA1c levels and the achievement of HbA1c <58 mmol/mol at follow-up. These models included Random Forest, XGBoost, Support Vector Machines (SVM), a conventional feedforward neural network (NN), and Bayesian Additive Regression Trees (BART). Model performance was assessed using standard metrics including R² and RMSE for regression tasks, and AUROC for classification.

Results:

All models exhibited similar performance, with no significant advantages of ML techniques over linear regression. For continuous outcomes, BART demonstrated the highest R² (0.445) and lowest RMSE (0.105), though differences among models were minimal. For the binary outcome, XGBoost achieved the highest AUC (0.712), with confidence intervals overlapping those of other models. Across all models, baseline HbA1c was consistently the primary predictor, explaining the majority of the variance. Sensitivity analyses and hyperparameter tuning did not significantly improve model performance.

Conclusions:

The findings suggest that, in this real-world cohort, ML models did not outperform traditional regression in predicting glycaemic response to sulphonylureas. This suggests that for modelling drug response, the limited improvement of machine learning over linear models may reflect a lack of strong non-linear effects or interacting predictors in the available clinical data, making it difficult for ML approaches to outperform logistic or linear regression. It is also possible that the clinical features used may not capture sufficient biological heterogeneity to leverage the strengths of complex modelling techniques.

Citation

Please cite as:

Garg S, Kitchen R, Gupta R, Trucco E, Pearson E

Predictors of Glycemic Response to Sulfonylurea Therapy in Type 2 Diabetes Over 12 Months: Comparative Analysis of Linear Regression and Machine Learning Models

JMIR Diabetes 2026;11:e82635

DOI: 10.2196/82635

PMID: 41650391

PMCID: 12880802

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Diabetes

Date Submitted: Aug 19, 2025

Date Accepted: Dec 11, 2025

Predictors of glycaemic response to sulphonylurea therapy in type 2 diabetes: A comparative analysis of linear regression and machine learning models

ABSTRACT

Citation

Copyright