Beyond Single Labels: Improving Conversational Recommendation through LLM-Powered Data Augmentation

Haozhe Xu; Xiaohua Wang; Changze Lv; Xiaoqing Zheng

arXiv:2508.05657·cs.IR·August 11, 2025

Beyond Single Labels: Improving Conversational Recommendation through LLM-Powered Data Augmentation

Haozhe Xu, Xiaohua Wang, Changze Lv, Xiaoqing Zheng

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel LLM-powered data augmentation framework for conversational recommender systems, improving recommendation accuracy by balancing semantic relevance and collaborative information.

Contribution

It proposes a two-stage data augmentation and training strategy leveraging LLMs to enhance CRS performance, addressing false negatives and data noise issues.

Findings

01

Significant performance improvements on benchmark datasets.

02

Effective balancing of semantic relevance and collaborative signals.

03

Robustness demonstrated across multiple recommender models.

Abstract

Conversational recommender systems (CRSs) enhance recommendation quality by engaging users in multi-turn dialogues, capturing nuanced preferences through natural language interactions. However, these systems often face the false negative issue, where items that a user might like are incorrectly labeled as negative during training, leading to suboptimal recommendations.Expanding the label set through data augmentation presents an intuitive solution but faces the challenge of balancing two key aspects: ensuring semantic relevance and preserving the collaborative information inherent in CRS datasets. To address these issues, we propose a novel data augmentation framework that first leverages an LLM-based semantic retriever to identify diverse and semantically relevant items, which are then filtered by a relevance scorer to remove noisy candidates. Building on this, we introduce a two-stage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Beyond Single Labels: Improving Conversational Recommendation through LLM-Powered Data Augmentation· underline

Taxonomy

TopicsRecommender Systems and Techniques · Natural Language Processing Techniques · Mathematics, Computing, and Information Processing