Are Large Language Models More Empathetic than Humans?
Anuradha Welivita, Pearl Pu

TL;DR
This study demonstrates that large language models, especially GPT-4, outperform humans in empathetic responses across diverse emotional prompts, challenging assumptions about human superiority in empathy.
Contribution
The paper introduces a comprehensive evaluation framework for assessing LLMs' empathy, showing their superior performance over humans in emotional response quality.
Findings
GPT-4 is approximately 31% more empathetic than humans.
LLaMA-2, Mixtral, and Gemini-Pro also outperform humans in empathy.
Different LLMs excel at responding to specific emotions.
Abstract
With the emergence of large language models (LLMs), investigating if they can surpass humans in areas such as emotion recognition and empathetic responding has become a focal point of research. This paper presents a comprehensive study exploring the empathetic responding capabilities of four state-of-the-art LLMs: GPT-4, LLaMA-2-70B-Chat, Gemini-1.0-Pro, and Mixtral-8x7B-Instruct in comparison to a human baseline. We engaged 1,000 participants in a between-subjects user study, assessing the empathetic quality of responses generated by humans and the four LLMs to 2,000 emotional dialogue prompts meticulously selected to cover a broad spectrum of 32 distinct positive and negative emotions. Our findings reveal a statistically significant superiority of the empathetic responding capability of LLMs over humans. GPT-4 emerged as the most empathetic, marking approximately 31% increase in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsAttention Is All You Need · Softmax · Layer Normalization · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Multi-Head Attention · Position-Wise Feed-Forward Layer
