Balanced Training Data Augmentation for Aspect-Based Sentiment Analysis

Junjie Liu; Yuanhe Tian; Yan Song

arXiv:2507.09485·cs.CL·July 15, 2025

Balanced Training Data Augmentation for Aspect-Based Sentiment Analysis

Junjie Liu, Yuanhe Tian, Yan Song

PDF

Open Access

TL;DR

This paper introduces a novel LLM-based data augmentation method for aspect-based sentiment analysis that balances training data and improves model performance by optimizing synthetic data quality through reinforcement learning.

Contribution

It proposes a reinforcement learning approach to enhance data augmentation quality in LLM-based ABSA, addressing data imbalance and short text challenges.

Findings

01

Improved ABSA performance over strong baselines.

02

Balanced training data enhances model accuracy.

03

Reinforcement learning optimizes synthetic data quality.

Abstract

Aspect-based sentiment analysis (ABSA) is a crucial fine-grained task in social media scenarios to identify the sentiment polarity of specific aspect terms in a sentence. Although many existing studies leverage large language models (LLMs) to perform ABSA due to their strong context understanding capabilities, they still face challenges to learn the context information in the running text because of the short text, as well as the small and unbalanced labeled training data, where most data are labeled with positive sentiment. Data augmentation (DA) is a feasible strategy for providing richer contextual information, especially when using LLMs to create synthetic training data, but faces challenges in ensuring a high quality of the augmented data.In this paper, we propose an LLM-based ABSA approach with training data augmentation.Specifically, an LLM is prompted to generate augmented…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining