TL;DR
This study evaluates various text representation methods and AutoML tools for classifying social media posts related to mental health risk, achieving state-of-the-art results using transfer learning with GPT-1.
Contribution
It demonstrates the effectiveness of transfer learning and pretrained language models, particularly GPT-1, for mental health risk classification on social media data.
Findings
GPT-1 features achieved the best classification performance.
The top system had a macro F1 score of 0.572.
Transfer learning improves risk prediction with limited labeled data.
Abstract
Mental illness affects a significant portion of the worldwide population. Online mental health forums can provide a supportive environment for those afflicted and also generate a large amount of data which can be mined to predict mental health states using machine learning methods. We benchmark multiple methods of text feature representation for social media posts and compare their downstream use with automated machine learning (AutoML) tools to triage content for moderator attention. We used 1588 labeled posts from the CLPsych 2017 shared task collected from the Reachout.com forum (Milne et al., 2019). Posts were represented using lexicon based tools including VADER, Empath, LIWC and also used pre-trained artificial neural network models including DeepMoji, Universal Sentence Encoder, and GPT-1. We used TPOT and auto-sklearn as AutoML tools to generate classifiers to triage the posts.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
