TY - JOUR
T1 - Identifying Insomnia from Social Media Posts
T2 - Psycholinguistic Analyses of User Tweets
AU - Sakib, Ahmed Shahriar
AU - Mukta, Md Saddam Hossain
AU - Huda, Fariha Rowshan
AU - Najmul Islam, A. K.M.
AU - Islam, Tohedul
AU - Ali, Mohammed Eunus
N1 - Publisher Copyright:
© 2021 Journal of Medical Internet Research. All rights reserved.
PY - 2021/12
Y1 - 2021/12
N2 - Background: Many people suffer from insomnia, a sleep disorder characterized by difficulty falling and staying asleep during the night. As social media have become a ubiquitous platform to share users' thoughts, opinions, activities, and preferences with their friends and acquaintances, the shared content across these platforms can be used to diagnose different health problems, including insomnia. Only a few recent studies have examined the prediction of insomnia from Twitter data, and we found research gaps in predicting insomnia from word usage patterns and correlations between users' insomnia and their Big 5 personality traits as derived from social media interactions. Objective: The purpose of this study is to build an insomnia prediction model from users' psycholinguistic patterns, including the elements of word usage, semantics, and their Big 5 personality traits as derived from tweets. Methods: In this paper, we exploited both psycholinguistic and personality traits derived from tweets to identify insomnia patients. First, we built psycholinguistic profiles of the users from their word choices and the semantic relationships between the words of their tweets. We then determined the relationship between a users' personality traits and insomnia. Finally, we built a double-weighted ensemble classification model to predict insomnia from both psycholinguistic and personality traits as derived from user tweets. Results: Our classification model showed strong prediction potential (78.8%) to predict insomnia from tweets. As insomniacs are generally ill-tempered and feel more stress and mental exhaustion, we observed significant correlations of certain word usage patterns among them. They tend to use negative words (eg, "no," "not," "never"). Some people frequently use swear words (eg, "damn," "piss," "fuck") with strong temperament. They also use anxious (eg, "worried," "fearful," "nervous") and sad (eg, "crying," "grief," "sad") words in their tweets. We also found that the users with high neuroticism and conscientiousness scores for the Big 5 personality traits likely have strong correlations with insomnia. Additionally, we observed that users with high conscientiousness scores have strong correlations with insomnia patterns, while negative correlation between extraversion and insomnia was also found. Conclusions: Our model can help predict insomnia from users' social media interactions. Thus, incorporating our model into a software system can help family members detect insomnia problems in individuals before they become worse. The software system can also help doctors to diagnose possible insomnia in patients.
AB - Background: Many people suffer from insomnia, a sleep disorder characterized by difficulty falling and staying asleep during the night. As social media have become a ubiquitous platform to share users' thoughts, opinions, activities, and preferences with their friends and acquaintances, the shared content across these platforms can be used to diagnose different health problems, including insomnia. Only a few recent studies have examined the prediction of insomnia from Twitter data, and we found research gaps in predicting insomnia from word usage patterns and correlations between users' insomnia and their Big 5 personality traits as derived from social media interactions. Objective: The purpose of this study is to build an insomnia prediction model from users' psycholinguistic patterns, including the elements of word usage, semantics, and their Big 5 personality traits as derived from tweets. Methods: In this paper, we exploited both psycholinguistic and personality traits derived from tweets to identify insomnia patients. First, we built psycholinguistic profiles of the users from their word choices and the semantic relationships between the words of their tweets. We then determined the relationship between a users' personality traits and insomnia. Finally, we built a double-weighted ensemble classification model to predict insomnia from both psycholinguistic and personality traits as derived from user tweets. Results: Our classification model showed strong prediction potential (78.8%) to predict insomnia from tweets. As insomniacs are generally ill-tempered and feel more stress and mental exhaustion, we observed significant correlations of certain word usage patterns among them. They tend to use negative words (eg, "no," "not," "never"). Some people frequently use swear words (eg, "damn," "piss," "fuck") with strong temperament. They also use anxious (eg, "worried," "fearful," "nervous") and sad (eg, "crying," "grief," "sad") words in their tweets. We also found that the users with high neuroticism and conscientiousness scores for the Big 5 personality traits likely have strong correlations with insomnia. Additionally, we observed that users with high conscientiousness scores have strong correlations with insomnia patterns, while negative correlation between extraversion and insomnia was also found. Conclusions: Our model can help predict insomnia from users' social media interactions. Thus, incorporating our model into a software system can help family members detect insomnia problems in individuals before they become worse. The software system can also help doctors to diagnose possible insomnia in patients.
KW - Big 5 personality traits
KW - Classification
KW - Insomnia
KW - Prediction model
KW - Psycholinguistics
KW - Social media
KW - Twitter
KW - Word embedding
UR - https://www.scopus.com/pages/publications/85121987992
U2 - 10.2196/27613
DO - 10.2196/27613
M3 - Article
C2 - 34889758
AN - SCOPUS:85121987992
SN - 1438-8871
VL - 23
JO - Journal of Medical Internet Research
JF - Journal of Medical Internet Research
IS - 12
M1 - e27613
ER -