Distilling Fine-grained Sentiment Understanding from Large Language   Models

Yice Zhang; Guangyu Xie; Hongling Xu; Kaiheng Hou; Jianzhu Bao,; Qianlong Wang; Shiwei Chen; and Ruifeng Xu

arXiv:2412.18552·cs.CL·December 31, 2024

Distilling Fine-grained Sentiment Understanding from Large Language Models

Yice Zhang, Guangyu Xie, Hongling Xu, Kaiheng Hou, Jianzhu Bao,, Qianlong Wang, Shiwei Chen, and Ruifeng Xu

PDF

Open Access 1 Repo 2 Models 1 Datasets

TL;DR

This paper proposes a method to transfer fine-grained sentiment understanding from large language models to smaller models, significantly improving their performance and zero-shot capabilities for sentiment analysis tasks.

Contribution

It introduces a distillation approach from LLMs to SLMs for fine-grained sentiment analysis and develops a benchmark to evaluate both models.

Findings

01

SLMs achieve a 6% improvement in F1-score after distillation.

02

Distilled SLMs outperform Llama-2-7b with only 220M parameters.

03

SLMs gain strong zero-shot sentiment classification abilities.

Abstract

Fine-grained sentiment analysis (FSA) aims to extract and summarize user opinions from vast opinionated text. Recent studies demonstrate that large language models (LLMs) possess exceptional sentiment understanding capabilities. However, directly deploying LLMs for FSA applications incurs high inference costs. Therefore, this paper investigates the distillation of fine-grained sentiment understanding from LLMs into small language models (SLMs). We prompt LLMs to examine and interpret the sentiments of given reviews and then utilize the generated content to pretrain SLMs. Additionally, we develop a comprehensive FSA benchmark to evaluate both SLMs and LLMs. Extensive experiments on this benchmark reveal that: (1) distillation significantly enhances the performance of SLMs in FSA tasks, achieving a 6.00\% improvement in $F_{1}$ -score, and the distilled model can outperform Llama-2-7b with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hitsz-hlt/fsa-distillation
pytorchOfficial

Models

Datasets

Gporrt/sentiment-understanding-corpus
dataset· 7 dl
7 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Natural Language Processing Techniques