Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jan 30, 2021
Date Accepted: Oct 5, 2021

The final, peer-reviewed published version of this preprint can be found here:

Identifying Insomnia From Social Media Posts: Psycholinguistic Analyses of User Tweets

Sakib AS, Mukta DMSH, Huda FR, Islam AN, Islam T, Ali ME

Identifying Insomnia From Social Media Posts: Psycholinguistic Analyses of User Tweets

J Med Internet Res 2021;23(12):e27613

DOI: 10.2196/27613

PMID: 34889758

PMCID: 8704110

Identifying Insomnia from Social Media Posts: Psycholinguistic Analyses of User Tweets

  • Ahmed Shahriar Sakib; 
  • Dr. Md. Saddam Hossain Mukta; 
  • Fariha Rowshan Huda; 
  • A.K.M. Najmul Islam; 
  • Tohedul Islam; 
  • Mohammed Eunus Ali

ABSTRACT

Background:

Many people suffer from insomnia, which is a sleep disorder characterized by difficulty in falling and staying asleep during nighttime. Social media platforms enable users to share their thoughts, opinions, activities, and preferences with their friends, family, and acquaintances. Many users share their Insomnia issues on these platforms as well these social media content can be used to diagnose different mental health problems. However, a few studies predict insomnia from Twitter, but we find still missing research where no researcher conducts strong semantic analysis (i.e., word embedding) in their word use pattern of tweets. We also find gap that no studies show correlation between users’ insomnia and their Big5 personality traits from social media interaction.

Objective:

The purpose of this study is to build an insomnia prediction model from users’ psycholinguistic, i.e., word use, pattern and their Big5 personality traits derived from tweets.

Methods:

In this paper, we exploit both psycholinguistic and personality traits derived from tweets to identify insomnia patients. As a first step of the process, we build users’ psycholinguistic profile from word choices and semantic relationship among the words of 1,755 users’ tweets. Then, we find the relationship between users’ personality traits, and insomnia. Finally, we build a two-step double weighted ensemble classification model to predict insomnia from both psycholinguistic and personality traits derived from user tweets.

Results:

Our classification model shows strong prediction potential (75%) to predict insomnia. Insomniac people are generally ill tempered, and irritable. They feel more stressed, angry, sad, and mentally exhausted. They tend to use negate category of words, i.e., no, not, never. These people frequently use swear category of words, i.e., damn, piss, fuck, etc. with strong temperament. They also use anxious (i.e., worried, fearful, nervous, etc.) and sad (crying, grief, sad, etc.) categories of words in their tweets. We also find that the users with high Neuroticism score likely to have strong correlation with insomnia. They tend to undergo through depression, social introversion, repression, and intolerance. We also observe that users with high conscientiousness scores have strong correlation with insomnia pattern. In our study, we find negative correlation between Extraversion Big5 trait and insomnia.

Conclusions:

In this study, we have investigated users’ psycholinguistic and personality association with insomnia. We have also found out semantic relationship among words of users’ tweets by using word embedding technique. Then, we have built a rigorous ensemble model by using the three different models. Our ensemble classifier have shown strong prediction potential (AUC-78%). We have built the classifier by using a novel 2-step weighted ensemble technique that outperforms the independent classifiers. We plan to improve our classifier by integrating more data on social network such as friend list, time of the tweets, gender, work place, time spent on activities, etc. We also plan to analyze tweets with different languages.


 Citation

Please cite as:

Sakib AS, Mukta DMSH, Huda FR, Islam AN, Islam T, Ali ME

Identifying Insomnia From Social Media Posts: Psycholinguistic Analyses of User Tweets

J Med Internet Res 2021;23(12):e27613

DOI: 10.2196/27613

PMID: 34889758

PMCID: 8704110

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.