TF-DCon: Leveraging Large Language Models (LLMs) to Empower Training-Free Dataset Condensation for Content-Based Recommendation
Jiahao Wu, Qijiong Liu, Hengchang Hu, Wenqi Fan, Shengcai Liu, Qing, Li, Xiao-Ming Wu, Ke Tang

TL;DR
This paper introduces TF-DCon, a training-free dataset condensation method for content-based recommendation that uses large language models to synthesize small, informative datasets, maintaining high performance with significantly reduced data size.
Contribution
The paper proposes a novel, non-iterative condensation approach leveraging LLMs for content and user data, specifically tailored for textual content-based recommendation tasks.
Findings
Achieves up to 97% of original performance on condensed datasets.
Reduces dataset size by 95% while maintaining high recommendation accuracy.
Demonstrates effectiveness across three real-world datasets.
Abstract
Modern techniques in Content-based Recommendation (CBR) leverage item content information to provide personalized services to users, but suffer from resource-intensive training on large datasets. To address this issue, we explore the dataset condensation for textual CBR in this paper. The goal of dataset condensation is to synthesize a small yet informative dataset, upon which models can achieve performance comparable to those trained on large datasets. While existing condensation approaches are tailored to classification tasks for continuous data like images or embeddings, direct application of them to CBR has limitations. To bridge this gap, we investigate efficient dataset condensation for content-based recommendation. Inspired by the remarkable abilities of large language models (LLMs) in text comprehension and generation, we leverage LLMs to empower the generation of textual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Recommender Systems and Techniques · Multimodal Machine Learning Applications
