Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: May 29, 2020
Date Accepted: Mar 16, 2021
Determination of Patient Sentiment and Emotion in Ophthalmology: Infoveillance Study on Online Health Forum Discussions
ABSTRACT
Background:
Clinical data present in social media is an underused source of information with great potential to allow for a deeper understanding of patient values, attitudes and preferences.
Objective:
We describe a novel, robust and modular method for sentiment analysis and emotion detection to free text from online medical health forums and the factors to consider during its application.
Methods:
We mined the full discussion and user information of all posts containing search terms related to a specific medical subspecialty (oculoplastics) from MedHelp, the largest online platform for patient health forums. We employed a variety of data cleaning and processing to define the relevant subset of results and prepare those results for sentiment analysis. We executed sentiment and emotion analysis through IBM Watson Natural Language Understanding service to generate sentiment and emotion scores for the posts and their associated keywords. Keywords were aggregated using natural language processing tools.
Results:
39 oculoplastics-related search terms resulted in 46,381 eligible posts within 14,329 threads, written by 18,319 users (117 doctors; 18,202 patients) and 201,611 associated keywords. Keywords that occurred ≥500 times in the corpus were used to identify most prominent topics, including specific symptoms, medication and complications. The sentiment and emotion scores of these keywords and eligible posts were further analyzed to provide concrete examples of the methodology’s potential to allow better understanding of patients’ attitudes. The overall sentiment score reflects a positive, neutral or negative sentiment, while the emotions (anger, disgust, fear, joy, sadness) scores represents the likelihood presence of the emotion. In major keyword groupings’ analyses, medical signs, symptoms and diseases had the lowest overall sentiment score (-0.598). Complications were highly associated with sadness (0.485). Forum posts mentioning body parts were related to sadness (0.416), but also fear (0.321). Administration was the category with the highest anger score (0.146). The top six forum subgroups had an overall negative sentiment score; the most negative one being the Neurology forum with a score of -0.438. The Undiagnosed Symptoms forum had the highest sadness score (0.448). The least likely fearful posts were the ones from the Eye Care forum, with a score of 0.260. Furthermore, the overall sentiment score is much more negative before the doctor replied. The anger, disgust, fear and sadness emotion scores decreased in likelihood, while joy was expressed slightly more likely after the doctor replied.
Conclusions:
This report allows physicians and researchers to efficiently mine and perform sentiment analysis on social media to better understand patients’ perspectives and promote patient-centric care. Important factors to be considered during application include evaluating the scope of the search, selecting search terms and understanding their different linguistic usages, and establishing selection, filtering and processing criteria for posts and keywords tailored to the desired results. Clinical Trial: This study did not require trial registration.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.