Accepted for/Published in: JMIR Medical Informatics
Date Submitted: May 11, 2025
Date Accepted: Oct 6, 2025
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Development and Validation of a Machine Learning-Based Fall Prediction Model for Hospitalized Patients
ABSTRACT
Background:
Falls among hospitalized patients are a critical issue which often leads to prolonged hospital stays and increased healthcare costs. Traditional fall risk assessments typically rely on standardized scoring systems; however, these may fail to capture the complex and multifactorial nature of fall risk factors.
Objective:
this multicenter cohort study aimed to develop and validate a machine learning-based model to predict in-hospital falls and to evaluate its performance in terms of both discrimination and calibration.
Methods:
We analyzed the data of 83,917 inpatients aged ≥ 65 years with a hospital stay of at least three days. Using Diagnosis Procedure Combination data and laboratory results, we extracted demographic, clinical, functional, and pharmacological variables. Following the selection of 30 key features, four predictive models were constructed: logistic regression, extreme gradient boosting, light gradient boosting machine (LGBM), and categorical boosting (CatBoost). The synthetic minority oversampling technique and isotonic regression calibration were applied to improve the prediction quality and address class imbalance.
Results:
Among the models, CatBoost demonstrated the highest F1 score and PRAUC, whereas LGBM exhibited the best calibration, closely approximating ideal performance. In addition, SHAP analysis identified frailty-related factors (e.g., albumin level and mobility) as well as the use of sedatives, hypnotics, and diabetes medications as significant factors that contribute to fall risk. Detailed incident report analyses further indicated that toileting-related falls were particularly common, especially during late-night and early-morning near the bedside.
Conclusions:
These findings highlight the value of integrating machine-learning models with comprehensive patient assessments to enhance fall risk prediction. By focusing on individualized patient conditions and implementing targeted strategies for safe toileting and mobility, more precise and effective fall prevention measures can be developed for hospitalized patients. Clinical Trial: Not applicable.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.