Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Previously submitted to: JMIR Dermatology (no longer under consideration since Oct 21, 2024)

Date Submitted: Jul 20, 2023
Open Peer Review Period: Jul 20, 2023 - Sep 14, 2023
(closed for review but you can still tweet)

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Sentiment analysis and natural language processing using Reddit data to evaluate patient opinions on hair loss therapeutics

  • Rachel Sally; 
  • Jerry Shapiro; 
  • Kristen Lo Sicco

ABSTRACT

Background:

Online forums are rich sources of user-derived data and harnessing this information utilizing natural language processing techniques can provide insights into patient experiences. The subject forums of Reddit may allow for focused explorations such as opinions on specific therapeutic agents.

Objective:

To determine patient sentiment about key treatments for female hair loss. Secondarily, to demonstrate the feasibility of using Reddit data to perform sentiment analysis on patient comments.

Methods:

A software pipeline scraped publicly available Reddit comments from r/femalehairloss, then processed them into sentence tokens. Sentiment analysis was subsequently performed. A frequency word representation was created.

Results:

The most frequently cited single treatments were minoxidil and spironolactone. Comments mentioning PRP and minoxidil were the second and third most positive on average. Comments referencing dutasteride were the most positive, however, this may be skewed by the low number of dutasteride-only comments. Finasteride comments were the least positive on average but were still slightly greater than 0.

Conclusions:

In this paper, we have demonstrated the feasibility of performing sentiment analysis on Reddit comments. Our results suggest that opinions about hair loss therapeutics on the examined forum were on average positive. Analysis of health-focused subreddits such as r/femalehairloss can provide a deeper understanding of patient discourse and may also represent an opportunity for physicians to disseminate evidence-based recommendations.


 Citation

Please cite as:

Sally R, Shapiro J, Lo Sicco K

Sentiment analysis and natural language processing using Reddit data to evaluate patient opinions on hair loss therapeutics

JMIR Preprints. 20/07/2023:50918

DOI: 10.2196/preprints.50918

URL: https://preprints.jmir.org/preprint/50918

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.