Accepted for/Published in: JMIR Formative Research
Date Submitted: Apr 27, 2021
Date Accepted: Jan 25, 2022
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Machine-aided Noninvasive Self-diagnostic Tool for Polycystic Ovary Syndrome: Observational Study
ABSTRACT
Background:
With the prolonged period of the Covid-19 global pandemic, artificial intelligent and digital health care have substantially advanced to improve and enhance diagnosis and treatment. In this study, we discuss development of a digital prediction tool for diagnosis of Polycystic Ovary Syndrome (PCOS) using machine learning techniques.
Objective:
We aim to develop a self-diagnostic prediction digital platform for Polycystic Ovary Syndrome based on noninvasive measures such as age, lifestyle, and anthropomorphic measures and symptoms, that do not require any lab or ultrasound results.
Methods:
In this retrospective study, a publicly available dataset of 541 women’s health information collected in Kerala, India, including PCOS status was acquired and used for analysis. Principal component analysis and K-means clustering are adopted to classify the sample into four subgroups based on anthropomorphic measures. The prediction for PCOS based on noninvasive measures is made on each subgroup using random forest classifiers and the prediction errors are estimated. Important predictors for diagnosing PCOS are identified for each subgroup.
Results:
By adopting subgroup models, we substantially improve the prediction error rates by 11.84% across the subgroups compared to using one model to the entire sample. The mean precision, sensitivity, accuracy and F1-score for the diagnosis of PCOS using the proposed subgroup models are 0.950, 0.972, 0.940 and 0.961, respectively. No invasive measures are used in this prediction.
Conclusions:
Anthropomorphic measures are found to be important variables to classify the women into subgroups. The results of the proposed subgroup models suggest that more accurate prediction for the diagnosis of PCOS can be made when different models are used for different subgroups rather than when a single model is used for the whole sample. This work enables women to conveniently access the proposed self-diagnosis prediction platform at home without delay before they seek for further medical care. Clinical Trial: This is an observational study.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.