Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: Jun 21, 2025
Open Peer Review Period: Jun 23, 2025 - Aug 18, 2025
Date Accepted: Oct 16, 2025
(closed for review but you can still tweet)
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Intersection of Big Five Personality Traits and Substance Use on X: Insight from the COVID-19 Pandemic
ABSTRACT
Background:
Personality traits are known predictors of substance use (SU), but their expression and association with SU during a global crisis remain largely unexamined at a population scale. The COVID-19 pandemic, which amplified both SU rates and online social engagement, created a unique natural experiment to investigate these dynamics through digital discourse. This approach offers insights beyond traditional self-report methods, which is crucial for developing timely and targeted public health interventions.
Objective:
To evaluate whether the associations between the Big Five personality traits and SU discourse shifted during the 2019–2021 period, and to conduct a focused analysis of how these traits predict SU and relate to specific substance types, emotional expression, and demographic factors.
Methods:
We analyzed a corpus of several hundred million public posts from a major social media platform from 2019 to 2021. Using a pipeline of natural language processing and deep learning models, we identified SU-related posts and subsequently extracted scores for the Big Five personality traits, emotions, and user demographics. We employed trend analysis to compare annual shifts in trait-SU associations, while detailed 2020 data underwent rigorous modeling using logistic regression, correlation analysis, and topic modeling to elucidate the core relationships.
Results:
Our analysis revealed that Extraversion (OR=3.22) and, most strikingly, Agreeableness (OR=4.04) were the strongest positive predictors of being a substance user. In stark contrast to the conventional self-medication hypothesis, Neuroticism emerged as a robust protective factor against SU (OR=0.29). This counterintuitive finding was supported by a decreased association between Neuroticism and SU posts at the pandemic's onset in 2020 (d=−0.13) and a negative correlation with the expression of negative emotions online. Topic modeling further indicated that SU discourse was frequently embedded in social contexts (Social Drinking, Friendly Beverage Choices) rather than themes of solitary coping.
Conclusions:
Our findings challenge traditional models by demonstrating that in large-scale online discourse, SU expression is more powerfully linked to social-affiliative traits than to negative emotionality. The paradoxical protective role of Neuroticism suggests that established risk profiles may not apply uniformly to digital environments, particularly during a public health crisis. These insights are vital for refining computational methods for public health surveillance and developing interventions that recognize the potent social drivers of substance use in the digital age.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.