Quantitative Assessment of Intersectional Empathetic Bias and Understanding
Vojtech Formanek, Ondrej Sotolar

TL;DR
This paper introduces a new framework for evaluating empathy in language models by measuring response variance to socially biased prompts, aiming for more valid and cross-lingual assessments.
Contribution
It operationalizes empathy measurement aligned with psychological theories and controls prompt generation to improve validity and translation quality.
Findings
Models showed significant changes in reasoning with different prompts.
Small variance observed in initial tests, indicating subtle sensitivity.
Framework provides a basis for future empathy evaluation research.
Abstract
A growing amount of literature critiques the current operationalizations of empathy based on loose definitions of the construct. Such definitions negatively affect dataset quality, model robustness, and evaluation reliability. We propose an empathy evaluation framework that operationalizes empathy close to its psychological origins. The framework measures the variance in responses of LLMs to prompts using existing metrics for empathy and emotional valence. The variance is introduced through the controlled generation of the prompts by varying social biases affecting context understanding, thus impacting empathetic understanding. The control over generation ensures high theoretical validity of the constructs in the prompt dataset. Also, it makes high-quality translation, especially into languages that currently have little-to-no way of evaluating empathy or bias, such as the Slavonic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics in Business and Education · Conflict Management and Negotiation
