Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Nov 5, 2024
Date Accepted: Feb 5, 2025

The final, peer-reviewed published version of this preprint can be found here:

Two-Year Hypertension Incidence Risk Prediction in Populations in the Desert Regions of Northwest China: Prospective Cohort Study

Yinlin C, Kuiying G, Weidong J, Zhengsheng H, Yining Y, Yi Z

Two-Year Hypertension Incidence Risk Prediction in Populations in the Desert Regions of Northwest China: Prospective Cohort Study

J Med Internet Res 2025;27:e68442

DOI: 10.2196/68442

PMID: 40072485

PMCID: 11947627

Two-Year Hypertension Incidence Risk Prediction: A Prospective Cohort Study in the Desert Regions of Northwest China

  • Cheng Yinlin; 
  • Gu Kuiying; 
  • Ji Weidong; 
  • Hu Zhengsheng; 
  • Yang Yining; 
  • Zhou Yi

ABSTRACT

Background:

Hypertension (HTN) is a major global health issue and a significant modifiable risk factor for cardiovascular diseases (CVD), contributing to a substantial socioeconomic burden due to its high prevalence. In China, particularly among populations living near desert regions, hypertension is even more prevalent due to unique environmental and lifestyle conditions, exacerbating the disease burden in these areas.

Objective:

This study aims to develop and validate a machine learning-based model for predicting the risk of developing hypertension within a two-year period.

Methods:

The study utilized health examination data collected between 2019 and 2023 from populations in four regions surrounding the Taklamakan Desert of China. A retrospective cohort (n=1,038,170) and a prospective cohort (n=961,519) were constructed. Four machine learning algorithms and two deep learning methods were employed, with Bayesian optimization used to fine-tune the hyperparameters of those models.

Results:

The two-year incidence rate of hypertension was found to be 15.20% (n=157,766). Among the models developed, the CatBoost model demonstrated the best performance, achieving AUC values of 0.888 (95% CI: 0.886-0.889) in the retrospective cohort and 0.803 (95% CI: 0.801-0.804) in the prospective cohort. Additionally, a web-based prediction tool based on this model was developed to facilitate its application in clinical practice.

Conclusions:

The machine learning model developed in this study effectively predicts the two-year risk of hypertension, making it particularly suitable for preventive health management in high-risk populations in desert regions. The model exhibits excellent predictive performance and has potential for clinical application. With the development of a web-based application, this model can be widely implemented in public health and personalized medicine, supporting early intervention and prevention of hypertension.


 Citation

Please cite as:

Yinlin C, Kuiying G, Weidong J, Zhengsheng H, Yining Y, Yi Z

Two-Year Hypertension Incidence Risk Prediction in Populations in the Desert Regions of Northwest China: Prospective Cohort Study

J Med Internet Res 2025;27:e68442

DOI: 10.2196/68442

PMID: 40072485

PMCID: 11947627

The author of this paper has made a PDF available, but requires the user to login, or create an account.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.