Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Oct 31, 2019
Date Accepted: Feb 21, 2020

The final, peer-reviewed published version of this preprint can be found here:

An Informatics Framework to Assess Consumer Health Language Complexity Differences: Proof-of-Concept Study

Yu B, He Z, Xing A, Lustria ML

An Informatics Framework to Assess Consumer Health Language Complexity Differences: Proof-of-Concept Study

J Med Internet Res 2020;22(5):e16795

DOI: 10.2196/16795

PMID: 32436849

PMCID: 7273233

An Informatics Framework to Assess Consumer Health Language Complexity Differences: A Proof-of-Concept Study

  • Biyang Yu; 
  • Zhe He; 
  • Aiwen Xing; 
  • Mia Liza Lustria

ABSTRACT

Background:

The language gap between health consumers and health professionals has been long recognized as the hindrance for effective health information comprehension. Although providing health information access in consumer health language is widely accepted as the solution to the problem, health consumers were found to have various health language preferences and proficiencies. To adaptively simplify health documents for heterogeneous consumer groups, it is important to quantify how consumer health languages are different in terms of complexity among various consumer groups.

Objective:

This study proposes a measurement tool (CHELCS, Consumer Health Language Complexity Score) to quantify the complexity of consumer health language (CHL) using syntax-level, text-level, term-level, and semantic-level complexity measurements. Specifically, we used CHELCS to compare posts of each individual in online health forums designed for: (a) the general public, (b) D/deaf and hard of hearing (D/hh) people, and for (3) people with autism spectrum disorder (ASD).

Methods:

Objective:

This study proposes an informatics framework (Consumer Health Language Complexity, CHELC) to assess the complexity differences of consumer health language (CHL) using syntax-level, text-level, term-level, and semantic-level complexity metrics. Specifically, we identified eight language complexity metrics validated in previous literature and combined them into a four-faceted framework. Through rank-based algorithm, we developed unifying scores (Consumer Health Language Complexity Scores, CHELCS) to quantify syntax-level (CHELCSsyntax), text-level (CHELCStext), term-level (CHELCSterm), semantic-level (CHELCSsemantic), and overall CHL complexity (CHELCSoverall). We applied CHELCS to compare posts of each individual in online health forums designed for: (a) the general public, (b) D/deaf and hard of hearing (D/hh) people, and for (3) people with autism spectrum disorder (ASD).

Results:

The results suggest that participants in the public forum used more complex CHL, particularly more diverse semantics and more complex health terms compared to those participating in the ASD and D/hh forums. However, between the latter two, ASD users used more complex words, and D/hh users used more complex syntax.

Conclusions:

Our results showed that the users in the three online forums had significantly different CHL complexities in different facets. The proposed framework and detailed measurements helped to comprehensively quantify these CHL complexity differences. The results emphasize the importance of tailoring health content for different consumer groups with varying CHL complexities.


 Citation

Please cite as:

Yu B, He Z, Xing A, Lustria ML

An Informatics Framework to Assess Consumer Health Language Complexity Differences: Proof-of-Concept Study

J Med Internet Res 2020;22(5):e16795

DOI: 10.2196/16795

PMID: 32436849

PMCID: 7273233

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.