Balanced Training Data Augmentation for Aspect-Based Sentiment Analysis
Junjie Liu, Yuanhe Tian, Yan Song

TL;DR
This paper introduces a novel LLM-based data augmentation method for aspect-based sentiment analysis that balances training data and improves model performance by optimizing synthetic data quality through reinforcement learning.
Contribution
It proposes a reinforcement learning approach to enhance data augmentation quality in LLM-based ABSA, addressing data imbalance and short text challenges.
Findings
Improved ABSA performance over strong baselines.
Balanced training data enhances model accuracy.
Reinforcement learning optimizes synthetic data quality.
Abstract
Aspect-based sentiment analysis (ABSA) is a crucial fine-grained task in social media scenarios to identify the sentiment polarity of specific aspect terms in a sentence. Although many existing studies leverage large language models (LLMs) to perform ABSA due to their strong context understanding capabilities, they still face challenges to learn the context information in the running text because of the short text, as well as the small and unbalanced labeled training data, where most data are labeled with positive sentiment. Data augmentation (DA) is a feasible strategy for providing richer contextual information, especially when using LLMs to create synthetic training data, but faces challenges in ensuring a high quality of the augmented data.In this paper, we propose an LLM-based ABSA approach with training data augmentation.Specifically, an LLM is prompted to generate augmented…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining
