Distilling Empathy from Large Language Models

Henry J. Xie; Jinghan Zhang; Xinhao Zhang; and Kunpeng Liu

arXiv:2507.08151·cs.CL·July 14, 2025

Distilling Empathy from Large Language Models

Henry J. Xie, Jinghan Zhang, Xinhao Zhang, and Kunpeng Liu

PDF

TL;DR

This paper presents a method for distilling empathy from large language models into smaller models, using a two-step fine-tuning process and targeted prompts, significantly improving empathetic response generation.

Contribution

It introduces a comprehensive empathy distillation approach with novel prompts and a two-step fine-tuning process to enhance empathy in smaller language models.

Findings

01

SLMs with empathy distillation outperform baseline models in empathetic response generation.

02

Targeted prompts improve empathy transfer, achieving a 10% higher win rate.

03

Two-step fine-tuning significantly enhances the empathetic capabilities of SLMs.

Abstract

The distillation of knowledge from Large Language Models (LLMs) into Smaller Language Models (SLMs), preserving the capabilities and performance of LLMs while reducing model size, has played a key role in the proliferation of LLMs. Because SLMs are considerably smaller than LLMs, they are often utilized in domains where human interaction is frequent but resources are highly constrained, e.g., smart phones. Therefore, it is crucial to ensure that empathy, a fundamental aspect of positive human interactions, already instilled into LLMs, is retained by SLMs after distillation. In this paper, we develop a comprehensive approach for effective empathy distillation from LLMs into SLMs. Our approach features a two-step fine-tuning process that fully leverages datasets of empathetic dialogue responses distilled from LLMs. We explore several distillation methods beyond basic direct prompting and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.