Preference Learning Unlocks LLMs' Psycho-Counseling Skills

Mian Zhang; Shaun M. Eack; Zhiyu Zoey Chen

arXiv:2502.19731·cs.CL·April 14, 2026

Preference Learning Unlocks LLMs' Psycho-Counseling Skills

Mian Zhang, Shaun M. Eack, Zhiyu Zoey Chen

PDF

1 Repo 2 Models 1 Datasets

TL;DR

This paper introduces a new preference dataset and reward model to enhance large language models' psycho-counseling abilities, achieving high performance aligned with professional standards.

Contribution

It creates PsychoCounsel-Preference, a large preference dataset, and develops PsychoCounsel-Llama3-8B, a model that significantly improves LLMs' psycho-counseling responses.

Findings

01

PsychoCounsel-Preference contains 36k high-quality preference pairs.

02

PsychoCounsel-Llama3-8B achieves an 87% win rate against GPT-4o.

03

The models and dataset are publicly released for further research.

Abstract

Applying large language models (LLMs) to assist in psycho-counseling is an emerging and meaningful approach, driven by the significant gap between patient needs and the availability of mental health support. However, current LLMs struggle to consistently provide effective responses to client speeches, largely due to the lack of supervision from high-quality real psycho-counseling data, whose content is typically inaccessible due to client privacy concerns. Furthermore, the quality of therapists' responses in available sessions can vary significantly based on their professional training and experience. Assessing the quality of therapists' responses remains an open challenge. In this work, we address these challenges by first proposing a set of professional and comprehensive principles to evaluate therapists' responses to client speeches. Using these principles, we create a preference…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://hf.co/Psychotherapy-LLM
github

Models

Datasets

Psychotherapy-LLM/PsyCoPref
dataset· 301 dl
301 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.