Labels Generated by Large Language Models Help Measure People's Empathy in Vitro

Md Rakibul Hasan; Yue Yao; Md Zakir Hossain; Aneesh Krishna; Imre Rudas; Shafin Rahman; Tom Gedeon

arXiv:2501.00691·cs.CL·March 17, 2026

Labels Generated by Large Language Models Help Measure People's Empathy in Vitro

Md Rakibul Hasan, Yue Yao, Md Zakir Hossain, Aneesh Krishna, Imre Rudas, Shafin Rahman, Tom Gedeon

PDF

1 Repo

TL;DR

This paper investigates how large language model-generated labels can enhance supervised training for empathy prediction tasks, leading to improved accuracy and more equitable models.

Contribution

It introduces methods for using LLM-generated labels to correct noise and augment data in empathy computing, achieving state-of-the-art results.

Findings

01

LLM-generated labels improve empathy prediction accuracy.

02

Replacing noisy labels with LLM labels yields significant gains.

03

The RoBERTa model trained with these labels achieves a Pearson correlation of 0.648.

Abstract

Large language models (LLMs) have revolutionised many fields, with LLM-as-a-service (LLMSaaS) offering accessible, general-purpose solutions without costly task-specific training. In contrast to the widely studied prompt engineering for directly solving tasks (in vivo), this paper explores LLMs' potential for in-vitro applications: using LLM-generated labels to improve supervised training of mainstream models. We examine two strategies - (1) noisy label correction and (2) training data augmentation - in empathy computing, an emerging task to predict psychology-based questionnaire outcomes from inputs like textual narratives. Crowdsourced datasets in this domain often suffer from noisy labels that misrepresent underlying empathy. We show that replacing or supplementing these crowdsourced labels with LLM-generated labels, developed using psychology-based scale-aware prompts, achieves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hasan-rakibul/llmpathy
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Layer Normalization · Dense Connections · Attention Dropout · WordPiece · Dropout · Linear Layer · Softmax · Linear Warmup With Linear Decay