EmPO: Emotion Grounding for Empathetic Response Generation through Preference Optimization
Ondrej Sotolar, Vojtech Formanek, Alok Debnath, Allison Lahnala,, Charles Welch, Lucie FLek

TL;DR
This paper introduces EmPO, a method that uses emotion-grounded preference datasets and optimization to improve empathetic response generation in large language models, maintaining their general performance.
Contribution
It presents a novel emotion grounding approach for creating preference datasets and aligning LLMs for empathetic responses without sacrificing generalization.
Findings
LLMs can be aligned for empathy via preference optimization.
Emotion grounding guides effective preference dataset creation.
Models retain general performance after alignment.
Abstract
Empathetic response generation is a desirable aspect of conversational agents, crucial for facilitating engaging and emotionally intelligent multi-turn conversations between humans and machines. Leveraging large language models for this task has shown promising results, yet challenges persist in ensuring both the empathetic quality of the responses and retention of the generalization performance of the models. We propose a novel approach where we construct theory-driven preference datasets based on emotion grounding and use them to align LLMs with preference optimization algorithms to address these challenges. To evaluate empathetic response generation, we employ the EmpatheticDialogues dataset, assessing empathy with the diff-Epitome and BERTscore metrics and with multi-dimensional human evaluation. Additionally, we measure diversity and emotional valence using feature-based methods.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Techniques and Practices
MethodsALIGN
