Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Mar 9, 2022
Date Accepted: Jul 28, 2022

The final, peer-reviewed published version of this preprint can be found here:

Web-Based Risk Prediction Tool for an Individual's Risk of HIV and Sexually Transmitted Infections Using Machine Learning Algorithms: Development and External Validation Study

Xu X, Yu Z, Ge Z, Chow E, Bao Y, Ong J, Li W, Wu J, Fairley C, Zhang L

Web-Based Risk Prediction Tool for an Individual's Risk of HIV and Sexually Transmitted Infections Using Machine Learning Algorithms: Development and External Validation Study

J Med Internet Res 2022;24(8):e37850

DOI: 10.2196/37850

PMID: 36006685

PMCID: 9459839

Development and external validation of a web-based risk prediction tool using machine learning algorithms for an individual's risk of HIV and sexually transmitted infections

  • Xianglong Xu; 
  • Zhen Yu; 
  • Zongyuan Ge; 
  • Eric Chow; 
  • Yining Bao; 
  • Jason Ong; 
  • Wei Li; 
  • Jinrong Wu; 
  • Christopher Fairley; 
  • Lei Zhang

ABSTRACT

Background:

HIV and sexually transmitted infections (STI) are major global public health concerns. Over one million curable STIs occur every day amongst people aged 15–49 years worldwide. Insufficient testing or screening substantially impedes the elimination of HIV/STI transmission.

Objective:

The aim of our study is to develop an HIV/STI risk prediction tool using machine learning algorithms.

Methods:

We used clinic consultations tested for HIV/STI at the Melbourne Sexual Health Centre between March 2, 2015, to December 31, 2018, as the development dataset (training and testing dataset). We also used two external validation datasets including data in 2019 as the external 'validation data 1' and data during 01/2020 and 01/2021 as the external 'validation data 2'. We developed 34 machine learning models to assess the risk of acquiring HIV, syphilis, gonorrhoea, and chlamydia. We created an online tool to generate an individual's risk of HIV/STI.

Results:

The important predictors for HIV/STIs risk were gender, age, men who reported having sex with men, number of casual sexual partners, and condom use. Our ML-based risk prediction tool named MySTIRisk performed at an acceptable or excellent level on testing datasets (area under the curve (AUC) for HIV= 0.78; syphilis = 0.84; gonorrhoea =0.78; chlamydia =0.70) which had stable performance on both external validation data in 2019 (AUC for HIV= 0.79; syphilis = 0.85; gonorrhoea = 0.81; chlamydia = 0.69), and data in 2020-2021(AUC for HIV= 0.71; syphilis= 0.84; gonorrhoea =0.79; chlamydia =0.69).

Conclusions:

Our web-based risk prediction tool could accurately predict the risk of HIV/STI in clinic attendees with simple self-reported questions. MySTIRisk could serve as an HIV/STI screening tool in clinic websites or digital health platforms to encourage individuals at risk of HIV/STI to have testing or start HIV pre-exposure prophylaxis. The public can use this tool to assess risk and then decide if they would attend a clinic for testing. Clinicians or public health workers can use this tool to identify high-risk individuals for risk management or further interventions.


 Citation

Please cite as:

Xu X, Yu Z, Ge Z, Chow E, Bao Y, Ong J, Li W, Wu J, Fairley C, Zhang L

Web-Based Risk Prediction Tool for an Individual's Risk of HIV and Sexually Transmitted Infections Using Machine Learning Algorithms: Development and External Validation Study

J Med Internet Res 2022;24(8):e37850

DOI: 10.2196/37850

PMID: 36006685

PMCID: 9459839

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.