Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Mar 4, 2021
Date Accepted: Mar 30, 2021

The final, peer-reviewed published version of this preprint can be found here:

Use of Machine Learning Algorithms to Predict the Understandability of Health Education Materials: Development and Evaluation Study

Ji M, Liu Y, Zhao M, Lyu Z, Zhang B, Luo X, Li Y, Zhong Y

Use of Machine Learning Algorithms to Predict the Understandability of Health Education Materials: Development and Evaluation Study

JMIR Med Inform 2021;9(5):e28413

DOI: 10.2196/28413

PMID: 33955834

PMCID: 8138706

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Predicting Health Educational Material Understandability using Machine Learning Algorithms

  • Meng Ji; 
  • Yanmeng Liu; 
  • Mengdan Zhao; 
  • Ziqing Lyu; 
  • Boren Zhang; 
  • Xin Luo; 
  • Yanlin Li; 
  • Yin Zhong

ABSTRACT

Background:

Improving the understandability of health information can significantly increase the cost-effectiveness and efficiency of health education programs for vulnerable populations. There is a pressing need to develop clinically informed computerized tools to enable rapid, reliable assessment of the linguistic understandability of specialized health and medical education resources.

Objective:

This paper fills a critical gap in current patient-oriented health resource development, which requires reliable, accurate evaluation instruments to increase the efficiency, cost-effectiveness of health education resource evaluation. We aim to translate internationally endorsed clinical guidelines, Patient Education Materials Assessment Tool (PEMAT) to machine learning algorithms to facilitate the evaluation of the understandability of health resources for international students at Australian universities.

Methods:

Based on international patient health resource assessment guidelines, we developed machine learning algorithms to predict the linguistic understandability of health texts for Australian college students (aged 25-30) from non-English speaking backgrounds. We compared extreme gradient boosting, random forest, neural networks, C5 decision tree for automated health information understandability evaluation. The five machine learning models achieved statistically better results compared to the baseline logistic regression model. We also evaluated the impact of each linguistic feature on the performance of each of the five models.

Results:

It was found that information evidentness, relevance to educational purposes and logical sequence were consistently more important than numeracy skills and medical knowledge when assessing the linguistic understandability of health education resources for international tertiary students with adequate English skills (IELT test score mean 6.5) and high health literacy (mean 16.5 in the Short Assessment of Health Literacy-English test). The results challenged traditional views that lack of medical knowledge and numerical skills constituted the barriers to the understanding of health educational materials.

Conclusions:

Machine learning algorithms were developed to predict health information understandability for international college students aged 25-30. 13 natural language features and 5 evaluation dimensions were identified and compared in terms of their impact on the performance of the models. Health information understandability varies according to the demographic profiles of the target readers, and for international tertiary students, improving health information evidentness, relevance and logic is critical.


 Citation

Please cite as:

Ji M, Liu Y, Zhao M, Lyu Z, Zhang B, Luo X, Li Y, Zhong Y

Use of Machine Learning Algorithms to Predict the Understandability of Health Education Materials: Development and Evaluation Study

JMIR Med Inform 2021;9(5):e28413

DOI: 10.2196/28413

PMID: 33955834

PMCID: 8138706

The author of this paper has made a PDF available, but requires the user to login, or create an account.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.