Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: Feb 20, 2025
Date Accepted: Apr 17, 2025
The use of natural language processing to interpret unstructured patient feedback on health services: A scoping review
ABSTRACT
Background:
Unstructured patient feedback (UPF) allows patients to freely express their experiences without the constraints of predefined questions. The proliferation of online healthcare rating websites has created a vast source of UPF. Natural language processing (NLP) techniques, particularly sentiment analysis and topic modelling, are increasingly being used to analyse UPF in healthcare settings, however the scope and clinical relevance of these technologies is unclear.
Objective:
This scoping review investigates how NLP techniques are being used to interpret UPF, with focus on the healthcare settings in which this is used, the purposes for using these technologies, and any impacts reported on clinical practice.
Methods:
Searches of the MEDLINE, EMBASE, CINAHL, Cochrane Database of Reviews, and Google Scholar were conducted in February 2024. No date limits were applied. English language studies that used NLP techniques on UPF that pertained to an identifiable health care setting or provider were included. Data extraction focused on the healthcare setting, NLP methods used, and applications of these techniques.
Results:
52 studies were included. NLP was most commonly applied to UPF from secondary care settings (n=33) with fewer in primary (n=10) or community (n=5) care. Three NLP techniques were identified in the included studies: sentiment analysis (n=32), topic modelling (n=15) and text classification (n=7). Sentiment analysis was applied to explore associations between patient sentiment and healthcare provider characteristics, track emotional responses over time, and identify areas for improvement in healthcare delivery. Topic modelling, primarily using Latent Dirichlet Allocation (LDA) algorithm, was employed to uncover latent themes in patient feedback, compare patient experiences across different healthcare settings, and track changes in patient concerns over time. Text classification was used to categorize patient feedback into predefined topics. The association between NLP-derived insights and traditional healthcare quality metrics was limited, with few studies describing concrete clinical impacts resulting from their analyses.
Conclusions:
NLP has been applied to UPF across a number of contexts, primarily to identify features of health services or professionals that support good patient experience. The growth of research publications demonstrates an academic interest in these technologies, but there is little evidence these approaches are being employed in clinical settings. Future research is required to assess how NLP may capture the nuance of healthcare interactions, align with existing quality metrics and how it may be used to influence clinician behaviour
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.