Semantic-preserved Augmentation with Confidence-weighted Fine-tuning for Aspect Category Sentiment Analysis
Yaping Chai, Haoran Xie, Joe S. Qin

TL;DR
This paper presents a novel data augmentation and fine-tuning approach for aspect category sentiment analysis using large language models, improving semantic consistency and prediction confidence to achieve state-of-the-art results.
Contribution
It introduces a structured prompt-based augmentation method with semantic preservation and a confidence-weighted fine-tuning strategy for improved sentiment analysis.
Findings
Achieves superior performance on four benchmark datasets.
Enhances semantic coverage and consistency in augmented data.
Improves model confidence and accuracy in sentiment predictions.
Abstract
Large language model (LLM) is an effective approach to addressing data scarcity in low-resource scenarios. Recent existing research designs hand-crafted prompts to guide LLM for data augmentation. We introduce a data augmentation strategy for the aspect category sentiment analysis (ACSA) task that preserves the original sentence semantics and has linguistic diversity, specifically by providing a structured prompt template for an LLM to generate predefined content. In addition, we employ a post-processing technique to further ensure semantic consistency between the generated sentence and the original sentence. The augmented data increases the semantic coverage of the training distribution, enabling the model better to understand the relationship between aspect categories and sentiment polarities, enhancing its inference capabilities. Furthermore, we propose a confidence-weighted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Text and Document Classification Technologies
