Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Aug 1, 2022
Date Accepted: Apr 8, 2023

The final, peer-reviewed published version of this preprint can be found here:

Using the H2O Automatic Machine Learning Algorithms to Identify Predictors of Web-Based Medical Record Nonuse Among Patients in a Data-Rich Environment: Mixed Methods Study

Zhou Q, Chen Y, Liu X, Zhu M, Shia BC, Chen MC, Ye L, Qin L

Using the H2O Automatic Machine Learning Algorithms to Identify Predictors of Web-Based Medical Record Nonuse Among Patients in a Data-Rich Environment: Mixed Methods Study

JMIR Med Inform 2023;11:e41576

DOI: 10.2196/41576

PMID: 37335616

PMCID: 10337515

Identifying Predictors of Non-Use of Online Medical Records Among Patients in a Data-Rich Environment by H2O’s Auto-Machine Learning

  • Qian Zhou; 
  • Yang Chen; 
  • Xuejiao Liu; 
  • Miao Zhu; 
  • Ben-Chang Shia; 
  • Ming-Chih Chen; 
  • Linglong Ye; 
  • Lei Qin

ABSTRACT

Background:

With the advent of electronic storage of medical records and the Internet, patients can access online medical records. This has facilitated doctor–patient communication and built trust between them.

Objective:

Based on demographic and individual behavioral characteristics, this study explores the predictors of and the reasons for online medical record nonuse.

Methods:

Data were collected from the National Cancer Institute’s 2019–2020 Health Information National Trends Survey. First, based on the data-rich environment, the chi-square test (categorical variable) and two-tailed t-tests (continuous variable) were performed on the response variables and the variables in the questionnaire. According to the test results, the variables were initially screened and those that passed the test were selected. Second, the samples with missing values in the initially screened variables were deleted. Finally, the data obtained were modeled using five machine learning algorithms, namely logistic regression, automatic generalized linear model, automatic random forests, automatic deep neural networks, and automatic gradient boosting machine, to identify and investigate factors affecting online medical record nonuse.

Results:

Automatic machine learning methods have a high prediction accuracy. Based on the validation dataset fitting performance, the optimal model was auto-random forests with an AUC value of 88.90%. Using the five models, 28 variables were identified as crucial predictors of nonuse of online medical records. They consisted of six sociodemographic variables and 22 variables related to individual lifestyles and behavioral habits.

Conclusions:

When monitoring online medical record usage trends, research should focus on social factors such as age, education, BMI, marital status, as well as personal lifestyle and behavioral habits, including smoking, use of electronic devices and Internet, patients’ personal health status, and their level of health concern wait. The use of electronic medical records can be promoted to specific groups of people in a targeted manner, so that the usefulness of medical records can be promoted in more groups of people.


 Citation

Please cite as:

Zhou Q, Chen Y, Liu X, Zhu M, Shia BC, Chen MC, Ye L, Qin L

Using the H2O Automatic Machine Learning Algorithms to Identify Predictors of Web-Based Medical Record Nonuse Among Patients in a Data-Rich Environment: Mixed Methods Study

JMIR Med Inform 2023;11:e41576

DOI: 10.2196/41576

PMID: 37335616

PMCID: 10337515

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.