Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Feb 14, 2023
Date Accepted: Aug 23, 2023

The final, peer-reviewed published version of this preprint can be found here:

Predicting the Risk of Sleep Disorders Using a Machine Learning–Based Simple Questionnaire: Development and Validation Study

Ha S, Choi SJ, Lee S, Wijaya RH, Kim JH, Joo EY, Kim JK

Predicting the Risk of Sleep Disorders Using a Machine Learning–Based Simple Questionnaire: Development and Validation Study

J Med Internet Res 2023;25:e46520

DOI: 10.2196/46520

PMID: 37733411

PMCID: 10557018

Predicting the Risk of Sleep Disorders using a Machine Learning-Based Simple Questionnaire: Development and Validation Study

  • Seokmin Ha; 
  • Su Jung Choi; 
  • Sujin Lee; 
  • Reinatt Hansel Wijaya; 
  • Jee Hyun Kim; 
  • Eun Yeon Joo; 
  • Jae Kyoung Kim

ABSTRACT

Background:

Sleep disorders, such as insomnia, obstructive sleep apnea (OSA), and comorbid insomnia and sleep apnea (COMISA), are common and can have serious health consequences. However, accurately diagnosing these conditions can be challenging as overnight polysomnography (PSG), the gold standard for diagnosis, is expensive, time-consuming, and cumbersome.

Objective:

We aim to develop a machine learning algorithm that can accurately predict the risk of OSA, COMISA, and insomnia with a simple set of questions, without the need for a PSG test.

Methods:

We applied XGBoost (eXtreme Gradient Boosting) to the data from two medical centers (N=4,257 from Samsung Medical Center and N=365 from Ewha Womans University Medical Center Seoul Hospital). Features were selected based on feature importance calculated by the SHapley Additive exPlanations (SHAP) method. We applied XGBoost using selected features to develop SLEEPS (SimpLe quEstionnairE Predicting Sleep disorders). The accuracy of the algorithm was evaluated using the area under the receiver operating characteristics curve (AUROC).

Results:

Nine features were selected to construct SLEEPS. SLEEPS showed high accuracy, with an AUROC of greater than 0.897 for all three sleep disorders, and consistent performance across both sets of data. We found that the distinction between COMISA and OSA was critical for accurate prediction. A publicly accessible website was created based on the algorithm that provides predictions for the risk of the three sleep disorders and shows how the risk changes with changes in weight or age.

Conclusions:

SLEEPS has the potential to improve the diagnosis and treatment of sleep disorders by providing more accessibility and convenience. The creation of a publicly accessible website based on the algorithm provides a user-friendly tool for assessing the risk of OSA, COMISA, and insomnia.


 Citation

Please cite as:

Ha S, Choi SJ, Lee S, Wijaya RH, Kim JH, Joo EY, Kim JK

Predicting the Risk of Sleep Disorders Using a Machine Learning–Based Simple Questionnaire: Development and Validation Study

J Med Internet Res 2023;25:e46520

DOI: 10.2196/46520

PMID: 37733411

PMCID: 10557018

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.