Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Sep 11, 2019
Date Accepted: Jan 24, 2020

The final, peer-reviewed published version of this preprint can be found here:

Comparisons Between Hypothesis- and Data-Driven Approaches for Multimorbidity Frailty Index: A Machine Learning Approach

Peng LN, Hsiao FY, Lee WJ, Huang ST, Chen LK

Comparisons Between Hypothesis- and Data-Driven Approaches for Multimorbidity Frailty Index: A Machine Learning Approach

J Med Internet Res 2020;22(6):e16213

DOI: 10.2196/16213

PMID: 32525481

PMCID: 7317629

Machine-Learning Multimorbidity Frailty Index: Comparisons between Hypothesis- and Data-Driven Approaches Using Taiwan’s National Health Insurance Research Database

  • Li-Ning Peng; 
  • Fei-Yuan Hsiao; 
  • Wei-Ju Lee; 
  • Shih-Tsung Huang; 
  • Liang-Kung Chen

ABSTRACT

Background:

The theory of cumulative deficits using big data to develop the multimorbidity frailty index (mFI) has become a widely accepted approach in public health and healthcare services. However, constructing the mFI using the most critical determinants and stratifying different risk groups with dose-response relationships remain major challenges in clinical practice.

Objective:

The final ML-mFI model contained 38 diseases/deficits in this study. Compared with conventional mFI, both indices had similar distribution patterns by age and sex; however, among people aged 65-69, the mean mFI and ML-mFI were 0.037 (standard deviation (SD) 0.048) and 0.0070 (SD 0.0254), respectively. The difference may result from discrepancies in the diseases/deficits selected in the mFI and the ML-mFI. A total of 86,133 subjects aged 65 to 100 years were included in this study and were categorized into 4 groups according to the level of the ML-mFI. Both the Kaplan-Meier survival curves and Cox models showed that the ML-mFI significantly predicted all outcomes of interest, including all-cause mortality, unplanned hospitalizations and all-cause ICU admissions, at 1, 5 and 8 years of follow-up (P<0.01). In particular, a dose-response relationship was revealed between the four ML-mFI groups and adverse outcomes.

Methods:

In this study, we used Taiwan’s National Health Insurance Research Database to develop machine learning multimorbidity frailty index (ML-mFI) using the theory of cumulative diseases/deficits of individual older person. Compared to conventional mFI by selecting diseases/deficits based on expert opinions, we adopted random forest to select most influential diseases/deficits to predict adverse outcomes for older people. To ensure the survival curves showed dose-responsive relationship with overlapping during the follow-up, we developed Distance Index and Coverage Index at any time point to categorize the ML-mFI of all subjects into robust, mild frailty, moderate frailty and severe frailty. Survival analysis was conducted to evaluate the ability of the ML-mFI in predicting adverse outcomes, such as unplanned hospitalizations, intensive care unit (ICU) admissions and mortality.

Results:

The final ML-mFI model contained 38 diseases/deficits in this study. Compared with conventional mFI, both indices had similar distribution patterns by age and sex; however, among people aged 65-69, the mean mFI and ML-mFI were 0.037 (standard deviation (SD) 0.048) and 0.0070 (SD 0.0254), respectively. The difference may be resulted from discrepancies of diseases/deficit selected in mFI and ML-mFI. A total of 86,133 subjects aged 65 to 100 years were included in this study and categorized into 4 groups according to the level of ML-mFI. Both the Kaplan-Meier survival curves and Cox models showed that ML-mFI significantly predicted all outcomes of interests, including all-cause mortality, unplanned hospitalization and all-cause ICU admission, among 1-, 5- and 8- years of follow-up. Particularly, a dose-responsive relationship were revealed between the four ML-mFI groups and adverse outcomes.

Conclusions:

The ML-mFI consists of 38 diseases/deficits that can successfully stratify risk groups associated with all-cause mortality, unplanned hospitalizations and all-cause ICU admissions in older people, which indicates that precise, patient-centered medical care can be a reality in an aging society.


 Citation

Please cite as:

Peng LN, Hsiao FY, Lee WJ, Huang ST, Chen LK

Comparisons Between Hypothesis- and Data-Driven Approaches for Multimorbidity Frailty Index: A Machine Learning Approach

J Med Internet Res 2020;22(6):e16213

DOI: 10.2196/16213

PMID: 32525481

PMCID: 7317629

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.