Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jun 28, 2023
Date Accepted: Apr 29, 2024
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Classification of Patients’ Judgments of Their Physicians in Web-Based Written Reviews Using Natural Language Processing: Algorithm Development and Validation

Madanay F, Tu K, Campagna A, Davis JK, Doerstling SS, Chen F, Ubel PA

Classification of Patients’ Judgments of Their Physicians in Web-Based Written Reviews Using Natural Language Processing: Algorithm Development and Validation

J Med Internet Res 2024;26:e50236

DOI: 10.2196/50236

PMID: 39088259

PMCID: 11327625

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

What patients like and dislike about their physicians: Developing and testing an algorithm to classify social judgments in online physician reviews

  • Farrah Madanay; 
  • Karissa Tu; 
  • Ada Campagna; 
  • J. Kelly Davis; 
  • Steven S. Doerstling; 
  • Felicia Chen; 
  • Peter A. Ubel

ABSTRACT

Background:

Patients increasingly rely on online physician reviews to choose a physician and share their experiences. However, the unstructured text of these online physician reviews presents a challenge for researchers seeking to make inferences about patients’ judgments. Methods previously used to identify patient judgments within reviews, such as hand-coding and dictionary-based approaches, have posed limitations to sample size and classification accuracy. Advanced natural language processing methods can help overcome these limitations and promote further analysis of physician reviews on these popular platforms.

Objective:

We aimed to train, test, and validate an advanced natural language processing algorithm for classifying the presence and valence of two social judgments in online physician reviews: interpersonal manner and technical competence.

Methods:

We sampled 345,053 reviews for 167,150 physicians across the United States from Healthgrades.com, a commercial online physician rating and review website. We hand-coded 2,000 reviews and used those reviews to train and test a transformer classification algorithm called Robustly Optimized BERT Pre-Training Approach (RoBERTa). The two fine-tuned models coded the reviews for the presence and positive or negative valence of patients’ interpersonal manner or technical competence judgments of their physicians. We evaluated the performance of the two models against 200 hand-coded reviews and validated the models using the full sample of 345,053 RoBERTa-coded reviews.

Results:

The interpersonal manner model was 90% accurate with precision of 0.89, recall of 0.90, and weighted F1 score of 0.89. The technical competence model was 90% accurate with precision of 0.91, recall of 0.90, and weighted F1 score of 0.90. Positive-valence judgments were associated with higher review star ratings whereas negative-valence judgments were associated with lower star ratings. Analysis of the data by review rating and physician gender corresponded with findings in prior literature.

Conclusions:

Our two classification models coded patients' interpersonal manner and technical competence judgments in online physician reviews with high precision, recall, and accuracy. These models were validated using review star ratings and results from previous research. RoBERTa can accurately classify unstructured, online review text at scale. Future work could explore the use of this algorithm with other textual data, such as social media posts and electronic health records.


 Citation

Please cite as:

Madanay F, Tu K, Campagna A, Davis JK, Doerstling SS, Chen F, Ubel PA

Classification of Patients’ Judgments of Their Physicians in Web-Based Written Reviews Using Natural Language Processing: Algorithm Development and Validation

J Med Internet Res 2024;26:e50236

DOI: 10.2196/50236

PMID: 39088259

PMCID: 11327625

Per the author's request the PDF is not available.