Beyond Architectures: Evaluating the Role of Contextual Embeddings in Detecting Bipolar Disorder on Social Media
Khalid Hasan, Jamil Saquer

TL;DR
This study evaluates various NLP models, especially transformer-based ones, for detecting bipolar disorder from social media posts, highlighting the importance of contextual embeddings for accurate mental health classification.
Contribution
It provides a comprehensive comparison of transformer and LSTM models using contextualized and static embeddings for bipolar disorder detection on social media.
Findings
RoBERTa achieves ~98% F1 score, outperforming other models.
LSTM with BERT embeddings performs nearly as well as transformer models.
Static embeddings with LSTM fail to capture meaningful patterns.
Abstract
Bipolar disorder is a chronic mental illness frequently underdiagnosed due to subtle early symptoms and social stigma. This paper explores the advanced natural language processing (NLP) models for recognizing signs of bipolar disorder based on user-generated social media text. We conduct a comprehensive evaluation of transformer-based models (BERT, RoBERTa, ALBERT, ELECTRA, DistilBERT) and Long Short Term Memory (LSTM) models based on contextualized (BERT) and static (GloVe, Word2Vec) word embeddings. Experiments were performed on a large, annotated dataset of Reddit posts after confirming their validity through sentiment variance and judgmental analysis. Our results demonstrate that RoBERTa achieves the highest performance among transformer models with an F1 score of ~98% while LSTM models using BERT embeddings yield nearly identical results. In contrast, LSTMs trained on static…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
