Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Public Health and Surveillance

Date Submitted: Jul 17, 2022
Date Accepted: Nov 30, 2022

The final, peer-reviewed published version of this preprint can be found here:

Predicting Risky Sexual Behavior Among College Students Through Machine Learning Approaches: Cross-sectional Analysis of Individual Data From 1264 Universities in 31 Provinces in China

Li X, Zhang H, Zhao S, Tang K

Predicting Risky Sexual Behavior Among College Students Through Machine Learning Approaches: Cross-sectional Analysis of Individual Data From 1264 Universities in 31 Provinces in China

JMIR Public Health Surveill 2023;9:e41162

DOI: 10.2196/41162

PMID: 36696166

PMCID: 9909517

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Predicting risky sexual behavior among college students through machine learning approaches: Analysis of individual data from 1264 universities in 31 provinces in China

  • Xuan Li; 
  • Hanxiyue Zhang; 
  • Shuangyu Zhao; 
  • Kun Tang

ABSTRACT

Background:

Risky sexual behavior (RSB), as the most direct risk factor for sexually transmitted infections (STIs), is common among college students. Thus, it is important to intervene and prevent it among college students by identifying relevant risk factors and making predictions.

Objective:

We aimed to establish a predictive model for RSB among college students to facilitate timely prevention and intervention before contraction of STIs.

Methods:

We included a total of 8,290 self-reported heterosexual Chinese students with sexual intercourse experience from November 2019 to February 2020. We identified RSB among those students and attributed it to four dimensions: whether contraception was used; whether the contraceptive method was safe; whether students engaged in casual sex or sex with multiple partners; and integrated RSB, which combined the first three dimensions. For each type, we compared various machine learning (ML) models according to multiple validation indicators and chose the optimal model for both RSB prediction and risk factor identification.

Results:

In total, 4993 (60·2%) students had ever engaged in RSB. Among them, 3422 (41·3%) did not use contraception every time they had sexual intercourse, 3393 (40·93%) had ever used an unsafe contraceptive method, and 1069 (12·9%) had casual sex or sex with multiple partners. Through comparison, the XGBoost (XGB) and gradient boosting machine (GBM) models achieved the optimal predictive performance on integrated RSB, with an area under the receiver operator characteristic curve (AUC) reaching 0·80. Under the condition of ensuring the stability of various validation indicators, the 12 most predictive variables were finally selected by XGB, including participants’ relationship status, sexual knowledge, sexual attitude, and previous sexual experience.

Conclusions:

RSB is prevalent among college students, and ML is an effective approach to predict RSB and identify corresponding risk factors.


 Citation

Please cite as:

Li X, Zhang H, Zhao S, Tang K

Predicting Risky Sexual Behavior Among College Students Through Machine Learning Approaches: Cross-sectional Analysis of Individual Data From 1264 Universities in 31 Provinces in China

JMIR Public Health Surveill 2023;9:e41162

DOI: 10.2196/41162

PMID: 36696166

PMCID: 9909517

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.