Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Feb 26, 2019
Date Accepted: Jul 7, 2019
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

The #MeToo Movement in the United States: Text Analysis of Early Twitter Conversations

Modrek S, Chakalov B

The #MeToo Movement in the United States: Text Analysis of Early Twitter Conversations

J Med Internet Res 2019;21(9):e13837

DOI: 10.2196/13837

PMID: 31482849

PMCID: 6751092

The Spark of the #MeTooMovement: Text Analysis of the Early Twitter Conversation

  • Sepideh Modrek; 
  • Bozhidar Chakalov

ABSTRACT

Background:

The #MeToo movement sparked a national debate on the sexual harassment, abuse and assault and has taken many directions since its inception in October of 2017. Much of the early conversation took place on public social media sites such as Twitter where the hashtag movement began.

Objective:

The intent of this study is to document, characterize and quantify the early public discourse and conversation of the #MeToo movement from Twitter data. We focus on posts with public first-person revelations of sexual assault/abuse and early life experiences of such events.

Methods:

We purchased full tweets and associated metadata from the Twitter Premium API between October 14th –October 21st, 2017; the first week of the movement. We examined the content of novel English language tweets with the phrase ‘MeToo’ from within the United States (N=11,935). We used machine learning methods, Least Absolute Shrinkage and Selection Operator regressions and Support Vector Machine models, to summarize and classify the content of individual tweets with revelations of sexual assault and abuse and early life experiences of sexual assault and abuse.

Results:

We show that the most predictive words create a vivid archetype of the revelations sexual assault and abuse. We then estimate that in the first week of the movement, 11% of novel English language tweets with the words ‘MeToo’ revealed details about the poster’s experience of sexual assault or abuse and 5.8% revealed early life experiences of such events. We examine the demographic composition of posters of sexual assault and abuse and find that white women aged 25-50 were overrepresented relative to their representation on Twitter and national estimates in posting about their experiences. Furthermore, we find that the mass sharing of personal experiences of sexual assault and abuse had a large reach where 6 to 34 million Twitter users may have seen such first-person revelation from someone they follow in this first week of the movement.

Conclusions:

These data illustrate that revelations shared went beyond acknowledgement of having experienced sexual harassment and often included vivid and traumatic descriptions of early life experiences of assault and abuse. These finding and methods underscore the value of content analysis, supported by novel machine learning methods, to improve our understanding of how widespread the revelations were which likely amplified the spread and saliency of the #MeToo movement.


 Citation

Please cite as:

Modrek S, Chakalov B

The #MeToo Movement in the United States: Text Analysis of Early Twitter Conversations

J Med Internet Res 2019;21(9):e13837

DOI: 10.2196/13837

PMID: 31482849

PMCID: 6751092

Per the author's request the PDF is not available.