From Reddit to Generative AI: Evaluating Large Language Models for Anxiety Support Fine-tuned on Social Media Data

Ugur Kursuncu; Trilok Padhi; Gaurav Sinha; Abdulkadir Erol; Jaya Krishna Mandivarapu; Christopher R. Larrison

arXiv:2505.18464·cs.HC·May 27, 2025

From Reddit to Generative AI: Evaluating Large Language Models for Anxiety Support Fine-tuned on Social Media Data

Ugur Kursuncu, Trilok Padhi, Gaurav Sinha, Abdulkadir Erol, Jaya Krishna Mandivarapu, Christopher R. Larrison

PDF

Open Access

TL;DR

This paper systematically evaluates large language models like GPT and Llama for anxiety support, revealing that fine-tuning improves language but can increase toxicity and reduce emotional supportiveness, highlighting associated risks.

Contribution

It introduces a mixed-method evaluation framework for assessing LLMs in sensitive mental health domains and provides insights into the effects of social media data fine-tuning.

Findings

01

Fine-tuning improves linguistic quality but increases toxicity.

02

GPT is more supportive than Llama in evaluations.

03

Fine-tuning on social media data can reduce emotional responsiveness.

Abstract

The growing demand for accessible mental health support, compounded by workforce shortages and logistical barriers, has led to increased interest in utilizing Large Language Models (LLMs) for scalable and real-time assistance. However, their use in sensitive domains such as anxiety support remains underexamined. This study presents a systematic evaluation of LLMs (GPT and Llama) for their potential utility in anxiety support by using real user-generated posts from the r/Anxiety subreddit for both prompting and fine-tuning. Our approach utilizes a mixed-method evaluation framework incorporating three main categories of criteria: (i) linguistic quality, (ii) safety and trustworthiness, and (iii) supportiveness. Results show that fine-tuning LLMs with naturalistic anxiety-related data enhanced linguistic quality but increased toxicity and bias, and diminished emotional responsiveness.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Mental Health Research Topics · Digital Mental Health Interventions

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Linear Layer · Layer Normalization · Byte Pair Encoding · Residual Connection · Discriminative Fine-Tuning · Dense Connections · Linear Warmup With Cosine Annealing