Multi-Domain ABSA Conversation Dataset Generation via LLMs for Real-World Evaluation and Model Comparison

Tejul Pandit; Meet Raval; and Dhvani Upadhyay

arXiv:2505.24701·cs.CL·June 2, 2025

Multi-Domain ABSA Conversation Dataset Generation via LLMs for Real-World Evaluation and Model Comparison

Tejul Pandit, Meet Raval, and Dhvani Upadhyay

PDF

Open Access

TL;DR

This paper introduces a method to generate synthetic ABSA conversation data using LLMs like GPT-4o, enabling better model evaluation across multiple domains without relying on scarce real-world datasets.

Contribution

It presents a novel LLM-based approach for creating multi-domain ABSA datasets, improving data diversity and supporting comprehensive model assessment.

Findings

01

Synthetic data improves model evaluation robustness

02

Different models show trade-offs in precision and recall

03

GPT-4o effectively generates diverse ABSA data

Abstract

Aspect-Based Sentiment Analysis (ABSA) offers granular insights into opinions but often suffers from the scarcity of diverse, labeled datasets that reflect real-world conversational nuances. This paper presents an approach for generating synthetic ABSA data using Large Language Models (LLMs) to address this gap. We detail the generation process aimed at producing data with consistent topic and sentiment distributions across multiple domains using GPT-4o. The quality and utility of the generated data were evaluated by assessing the performance of three state-of-the-art LLMs (Gemini 1.5 Pro, Claude 3.5 Sonnet, and DeepSeek-R1) on topic and sentiment classification tasks. Our results demonstrate the effectiveness of the synthetic data, revealing distinct performance trade-offs among the models: DeepSeekR1 showed higher precision, Gemini 1.5 Pro and Claude 3.5 Sonnet exhibited strong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques