Chinese MentalBERT: Domain-Adaptive Pre-training on Social Media for Chinese Mental Health Text Analysis
Wei Zhai, Hongzhi Qi, Qing Zhao, Jianqiang Li, Ziqi Wang, Han Wang,, Bing Xiang Yang, Guanghui Fu

TL;DR
Chinese MentalBERT is a domain-adapted pre-trained language model specifically designed for Chinese social media mental health text analysis, leveraging a large psychological dataset and lexicons to improve understanding of psychological expressions.
Contribution
The paper introduces Chinese MentalBERT, a novel domain-adaptive pre-training approach for Chinese mental health text analysis using social media data and psychological lexicons.
Findings
Improved performance on six public datasets compared to eight models.
Effective in providing psychologically relevant predictions.
Utilizes a large, specialized dataset for domain adaptation.
Abstract
In the current environment, psychological issues are prevalent and widespread, with social media serving as a key outlet for individuals to share their feelings. This results in the generation of vast quantities of data daily, where negative emotions have the potential to precipitate crisis situations. There is a recognized need for models capable of efficient analysis. While pre-trained language models have demonstrated their effectiveness broadly, there's a noticeable gap in pre-trained models tailored for specialized domains like psychology. To address this, we have collected a huge dataset from Chinese social media platforms and enriched it with publicly available datasets to create a comprehensive database encompassing 3.36 million text entries. To enhance the model's applicability to psychological text analysis, we integrated psychological lexicons into the pre-training masking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing
