Contrastive Distillation of Emotion Knowledge from LLMs for Zero-Shot Emotion Recognition

Minxue Niu; Emily Mower Provost

arXiv:2505.18040·cs.CL·May 26, 2025

Contrastive Distillation of Emotion Knowledge from LLMs for Zero-Shot Emotion Recognition

Minxue Niu, Emily Mower Provost

PDF

1 Repo

TL;DR

This paper introduces a contrastive distillation method that transfers emotion knowledge from large language models into a compact model, enabling zero-shot emotion recognition across diverse label sets without human annotations.

Contribution

It presents a novel framework that distills rich emotion understanding from LLMs into small models, enhancing zero-shot ER performance without relying on fixed label sets or human annotations.

Findings

01

Distilled model outperforms similar-sized baselines

02

Approaches GPT-4's zero-shot performance

03

Over 10,000 times smaller than LLMs

Abstract

The ability to handle various emotion labels without dedicated training is crucial for building adaptable Emotion Recognition (ER) systems. Conventional ER models rely on training using fixed label sets and struggle to generalize beyond them. On the other hand, Large Language Models (LLMs) have shown strong zero-shot ER performance across diverse label spaces, but their scale limits their use on edge devices. In this work, we propose a contrastive distillation framework that transfers rich emotional knowledge from LLMs into a compact model without the use of human annotations. We use GPT-4 to generate descriptive emotion annotations, offering rich supervision beyond fixed label sets. By aligning text samples with emotion descriptors in a shared embedding space, our method enables zero-shot prediction on different emotion classes, granularity, and label schema. The distilled model is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chailab-umich/uni_emotion_release
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Linear Layer · Dense Connections · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing · Multi-Head Attention · Layer Normalization · Byte Pair Encoding