Iterative Data Generation with Large Language Models for Aspect-based Sentiment Analysis
Qihuang Zhong, Haiyun Li, Luyao Zhuang, Juhua Liu, Bo Du

TL;DR
This paper introduces IDG, an iterative data generation framework leveraging large language models to produce high-quality, diverse pseudo-labeled data for aspect-based sentiment analysis, significantly improving model performance.
Contribution
The paper proposes a novel iterative data generation method with self-reflection filtering to enhance LLM-generated data quality for ABSA tasks, addressing issues of fluency, diversity, and hallucinations.
Findings
IDG improves performance across multiple ABSA benchmarks.
Synthetic data from IDG matches or exceeds manual annotations.
IDG demonstrates consistent gains over baseline models.
Abstract
Aspect-based Sentiment Analysis (ABSA) is an important sentiment analysis task, which aims to determine the sentiment polarity towards an aspect in a sentence. Due to the expensive and limited labeled data, data generation (DG) has become the standard for improving the performance of ABSA. However, current DG methods usually have some shortcomings: 1) poor fluency and coherence, 2) lack of diversity of generated data, and 3) reliance on some existing labeled data, hindering its applications in real-world scenarios. With the advancement of large language models (LLMs), LLM-based DG has the potential to solve the above issues. Unfortunately, directly prompting LLMs struggles to generate the desired pseudo-label ABSA data, as LLMs are prone to hallucinations, leading to undesired data generation. To this end, we propose a systematic Iterative Data Generation framework, namely IDG, to boost…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Web Data Mining and Analysis · Advanced Text Analysis Techniques
