Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Feb 20, 2025
Date Accepted: Apr 17, 2025

The final, peer-reviewed published version of this preprint can be found here:

The Use of Natural Language Processing to Interpret Unstructured Patient Feedback on Health Services: Scoping Review

Feizollah A, Lin CY, O'malley L, Thompson W, Listl S, Byrne M

The Use of Natural Language Processing to Interpret Unstructured Patient Feedback on Health Services: Scoping Review

J Med Internet Res 2025;27:e72853

DOI: 10.2196/72853

PMID: 40811803

PMCID: 12352804

The use of natural language processing to interpret unstructured patient feedback on health services: A scoping review

  • Ali Feizollah; 
  • Chiu-Yi Lin; 
  • Lucy O'malley; 
  • Wendy Thompson; 
  • Stefan Listl; 
  • Matthew Byrne

ABSTRACT

Background:

Unstructured patient feedback (UPF) allows patients to freely express their experiences without the constraints of predefined questions. The proliferation of online healthcare rating websites has created a vast source of UPF. Natural language processing (NLP) techniques, particularly sentiment analysis and topic modelling, are increasingly being used to analyse UPF in healthcare settings, however the scope and clinical relevance of these technologies is unclear.

Objective:

This scoping review investigates how NLP techniques are being used to interpret UPF, with focus on the healthcare settings in which this is used, the purposes for using these technologies, and any impacts reported on clinical practice.

Methods:

Searches of the MEDLINE, EMBASE, CINAHL, Cochrane Database of Reviews, and Google Scholar were conducted in February 2024. No date limits were applied. English language studies that used NLP techniques on UPF that pertained to an identifiable health care setting or provider were included. Data extraction focused on the healthcare setting, NLP methods used, and applications of these techniques.

Results:

52 studies were included. NLP was most commonly applied to UPF from secondary care settings (n=33) with fewer in primary (n=10) or community (n=5) care. Three NLP techniques were identified in the included studies: sentiment analysis (n=32), topic modelling (n=15) and text classification (n=7). Sentiment analysis was applied to explore associations between patient sentiment and healthcare provider characteristics, track emotional responses over time, and identify areas for improvement in healthcare delivery. Topic modelling, primarily using Latent Dirichlet Allocation (LDA) algorithm, was employed to uncover latent themes in patient feedback, compare patient experiences across different healthcare settings, and track changes in patient concerns over time. Text classification was used to categorize patient feedback into predefined topics. The association between NLP-derived insights and traditional healthcare quality metrics was limited, with few studies describing concrete clinical impacts resulting from their analyses.

Conclusions:

NLP has been applied to UPF across a number of contexts, primarily to identify features of health services or professionals that support good patient experience. The growth of research publications demonstrates an academic interest in these technologies, but there is little evidence these approaches are being employed in clinical settings. Future research is required to assess how NLP may capture the nuance of healthcare interactions, align with existing quality metrics and how it may be used to influence clinician behaviour


 Citation

Please cite as:

Feizollah A, Lin CY, O'malley L, Thompson W, Listl S, Byrne M

The Use of Natural Language Processing to Interpret Unstructured Patient Feedback on Health Services: Scoping Review

J Med Internet Res 2025;27:e72853

DOI: 10.2196/72853

PMID: 40811803

PMCID: 12352804

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.