Distilling Empathy from Large Language Models
Henry J. Xie, Jinghan Zhang, Xinhao Zhang, and Kunpeng Liu

TL;DR
This paper presents a method for distilling empathy from large language models into smaller models, using a two-step fine-tuning process and targeted prompts, significantly improving empathetic response generation.
Contribution
It introduces a comprehensive empathy distillation approach with novel prompts and a two-step fine-tuning process to enhance empathy in smaller language models.
Findings
SLMs with empathy distillation outperform baseline models in empathetic response generation.
Targeted prompts improve empathy transfer, achieving a 10% higher win rate.
Two-step fine-tuning significantly enhances the empathetic capabilities of SLMs.
Abstract
The distillation of knowledge from Large Language Models (LLMs) into Smaller Language Models (SLMs), preserving the capabilities and performance of LLMs while reducing model size, has played a key role in the proliferation of LLMs. Because SLMs are considerably smaller than LLMs, they are often utilized in domains where human interaction is frequent but resources are highly constrained, e.g., smart phones. Therefore, it is crucial to ensure that empathy, a fundamental aspect of positive human interactions, already instilled into LLMs, is retained by SLMs after distillation. In this paper, we develop a comprehensive approach for effective empathy distillation from LLMs into SLMs. Our approach features a two-step fine-tuning process that fully leverages datasets of empathetic dialogue responses distilled from LLMs. We explore several distillation methods beyond basic direct prompting and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
