From Reddit to Generative AI: Evaluating Large Language Models for Anxiety Support Fine-tuned on Social Media Data
Ugur Kursuncu, Trilok Padhi, Gaurav Sinha, Abdulkadir Erol, Jaya Krishna Mandivarapu, Christopher R. Larrison

TL;DR
This paper systematically evaluates large language models like GPT and Llama for anxiety support, revealing that fine-tuning improves language but can increase toxicity and reduce emotional supportiveness, highlighting associated risks.
Contribution
It introduces a mixed-method evaluation framework for assessing LLMs in sensitive mental health domains and provides insights into the effects of social media data fine-tuning.
Findings
Fine-tuning improves linguistic quality but increases toxicity.
GPT is more supportive than Llama in evaluations.
Fine-tuning on social media data can reduce emotional responsiveness.
Abstract
The growing demand for accessible mental health support, compounded by workforce shortages and logistical barriers, has led to increased interest in utilizing Large Language Models (LLMs) for scalable and real-time assistance. However, their use in sensitive domains such as anxiety support remains underexamined. This study presents a systematic evaluation of LLMs (GPT and Llama) for their potential utility in anxiety support by using real user-generated posts from the r/Anxiety subreddit for both prompting and fine-tuning. Our approach utilizes a mixed-method evaluation framework incorporating three main categories of criteria: (i) linguistic quality, (ii) safety and trustworthiness, and (iii) supportiveness. Results show that fine-tuning LLMs with naturalistic anxiety-related data enhanced linguistic quality but increased toxicity and bias, and diminished emotional responsiveness.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Mental Health Research Topics · Digital Mental Health Interventions
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Linear Layer · Layer Normalization · Byte Pair Encoding · Residual Connection · Discriminative Fine-Tuning · Dense Connections · Linear Warmup With Cosine Annealing
