DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization

Chao Zhang; Xin Shi; Xueqiao Zhang; Yifan Zhu; Yi Yang; Yawei Luo

arXiv:2505.16995·cs.CL·January 19, 2026

DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization

Chao Zhang, Xin Shi, Xueqiao Zhang, Yifan Zhu, Yi Yang, Yawei Luo

PDF

Open Access

TL;DR

This paper introduces DecoupledESC, a framework that improves emotional support conversation generation by separating strategy planning and response generation, utilizing a new dataset and preference optimization to reduce errors and enhance response quality.

Contribution

It proposes a decoupled approach for ESC that leverages a new dataset and preference optimization, addressing data entanglement and ambiguity issues in existing methods.

Findings

01

Outperforms joint optimization baselines in experiments.

02

Reduces psychological errors in generated responses.

03

Enhances response quality through decoupled training.

Abstract

Recent advances in Emotional Support Conversation (ESC) have improved emotional support generation by fine-tuning Large Language Models (LLMs) via Supervised Fine-Tuning (SFT). However, common psychological errors still persist. While Direct Preference Optimization (DPO) shows promise in reducing such errors through pairwise preference learning, its effectiveness in ESC tasks is limited by two key challenges: (1) Entangled data structure: Existing ESC data inherently entangles psychological strategies and response content, making it difficult to construct high-quality preference pairs; and (2) Optimization ambiguity: Applying vanilla DPO to such entangled pairwise data leads to ambiguous training objectives. To address these issues, we introduce Inferential Preference Mining (IPM) to construct high-quality preference data, forming the IPM-PrefDial dataset. Building upon this data, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Mental Health via Writing · Emotion and Mood Recognition

MethodsShrink and Fine-Tune · Direct Preference Optimization · ALIGN