TL;DR
This paper introduces methods for generating multi-strategy supportive utterances in emotional support conversations, demonstrating improved support quality through multi-strategy modeling and reinforcement learning.
Contribution
It formulates multi-strategy utterance generation for ESC, proposing All-in-One and One-by-One methods with reinforcement learning, and provides the first empirical evidence of their effectiveness.
Findings
Multi-strategy utterances improve supportive quality.
Proposed methods outperform single-strategy baselines.
Reinforcement learning enhances strategy selection and response quality.
Abstract
Emotional Support Conversation (ESC) aims to assist individuals experiencing distress by generating empathetic and supportive dialogue. While prior work typically assumes that each supporter turn corresponds to a single strategy, real-world supportive communication often involves multiple strategies within a single utterance. In this paper, we revisit the ESC task by formulating it as multi-strategy utterance generation, where each utterance may contain one or more strategy-response pairs. We propose two generation methods: All-in-One, which predicts all strategy-response pairs in a single decoding step, and One-by-One, which iteratively generates strategy-response pairs until completion. Both methods are further enhanced with cognitive reasoning guided by reinforcement learning to improve strategy selection and response composition. We evaluate our models on the ESConv dataset under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
