Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: Jun 2, 2024
Date Accepted: Nov 22, 2024
A Practical web tool for Evaluation of Mortality Risk in Anti-MDA5+DM-ILD Patients Using Multi-center Retrospective Analysis in China: Algorithm Development and Validation
ABSTRACT
Background:
Patients with anti-melanoma differentiation-related gene 5 antibody positive dermatomyositis-associated interstitial (MDA5+DM-ILD) were prone to rapidly progressive interstitial lung disease (RP-ILD) and high mortality. Therefore, a validated prediction model for mortality evaluation using a concise and practicable web-based tool needs to be established.
Objective:
To develop and validate a risk prediction model of 3-month mortality using machine learning in a large multi-center cohort of patients with anti-MDA5+DM-ILD patients in China.
Methods:
Consecutive 609 patients with anti-MDA5+DM-ILD were enrolled retrospectively from 6 hospitals in China. Patient demographics and laboratory and clinical parameters were collected on admission. The primary endpoints were the 3-month mortality for all causes. Six machine learning algorithms including extreme gradient boosting, logistic regression, Light Gradient Boosting Machine, random forest, support vector machine and k-nearest neighbor algorithm were applied to construct the model and evaluate the model using k-fold cross validation method, ROC curve, calibration curve, decision curve analysis (DCA) and external validation.
Results:
A total of 509 patients from 6 institutions in China were finally included in the training cohort (n =203), an internal validation cohort (n=51) and an external validation cohort 1 (n=92) and cohort 2 (n=163). respectively. By machine learning, eight important variables including RP-ILD, erythrocyte sedimentation rate, serum albumin, age, C-reactive protein level, aspartate aminotransferase, lactate dehydrogenase, ratio of the absolute neutrophil count and the absolute lymphocyte count, were screened out and identified as for model construction. Compared with different machine learning approaches, logistic regression finally was selected as optimal model construction algorithm. The prediction model constructed by logistic regression achieved the best performance with an AUC, sensitivity, and specificity of 0.866,84.8%, and 84.4% on the validation set and 0.90, 85.0%, and 83.9% on the test set, respectively. The calibration curve and decision curve analyses showed satisfactory consistency and clinical utility of this prediction model. Furthermore, the optimal logistic model achieved encouraging predictive performance in the external validation cohort 1 (AUC 0.836, 95% CI, 0.754–0.916), and external validation cohort 2 (AUC 0.915, 95% CI, 0.871–0.959), indicating that the prediction model had better extrapolation.
Conclusions:
A robust clinical model and a web tool for predicting the risk of 3-month mortality of anti-MDA5+DM-ILD patients was successfully developed.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.