Enhancing Depressive Post Detection in Bangla: A Comparative Study of TF-IDF, BERT and FastText Embeddings
Saad Ahmed Sazan, Mahdi H. Miraz, A B M Muntasir Rahman

TL;DR
This paper compares TF-IDF, BERT, and FastText embeddings within a CNN-BiLSTM model to detect depressive posts in Bangla social media data, demonstrating BERT's superior performance with an 84% F1-score.
Contribution
It introduces a novel approach combining advanced NLP techniques and deep learning for depressive post detection in Bangla, with a comprehensive comparison of embedding methods.
Findings
BERT embedding outperforms TF-IDF and FastText in detection accuracy.
Oversampling improves model performance on imbalanced data.
The proposed method achieves an 84% F1-score, surpassing existing techniques.
Abstract
Due to massive adoption of social media, detection of users' depression through social media analytics bears significant importance, particularly for underrepresented languages, such as Bangla. This study introduces a well-grounded approach to identify depressive social media posts in Bangla, by employing advanced natural language processing techniques. The dataset used in this work, annotated by domain experts, includes both depressive and non-depressive posts, ensuring high-quality data for model training and evaluation. To address the prevalent issue of class imbalance, we utilised random oversampling for the minority class, thereby enhancing the model's ability to accurately detect depressive posts. We explored various numerical representation techniques, including Term Frequency-Inverse Document Frequency (TF-IDF), Bidirectional Encoder Representations from Transformers (BERT)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health Research Topics · Mental Health via Writing
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Residual Connection · Layer Normalization · fastText · Linear Layer · Attention Dropout · Linear Warmup With Linear Decay · Adam · Dropout
