TF-DCon: Leveraging Large Language Models (LLMs) to Empower   Training-Free Dataset Condensation for Content-Based Recommendation

Jiahao Wu; Qijiong Liu; Hengchang Hu; Wenqi Fan; Shengcai Liu; Qing; Li; Xiao-Ming Wu; Ke Tang

arXiv:2310.09874·cs.IR·February 11, 2025·1 cites

TF-DCon: Leveraging Large Language Models (LLMs) to Empower Training-Free Dataset Condensation for Content-Based Recommendation

Jiahao Wu, Qijiong Liu, Hengchang Hu, Wenqi Fan, Shengcai Liu, Qing, Li, Xiao-Ming Wu, Ke Tang

PDF

Open Access

TL;DR

This paper introduces TF-DCon, a training-free dataset condensation method for content-based recommendation that uses large language models to synthesize small, informative datasets, maintaining high performance with significantly reduced data size.

Contribution

The paper proposes a novel, non-iterative condensation approach leveraging LLMs for content and user data, specifically tailored for textual content-based recommendation tasks.

Findings

01

Achieves up to 97% of original performance on condensed datasets.

02

Reduces dataset size by 95% while maintaining high recommendation accuracy.

03

Demonstrates effectiveness across three real-world datasets.

Abstract

Modern techniques in Content-based Recommendation (CBR) leverage item content information to provide personalized services to users, but suffer from resource-intensive training on large datasets. To address this issue, we explore the dataset condensation for textual CBR in this paper. The goal of dataset condensation is to synthesize a small yet informative dataset, upon which models can achieve performance comparable to those trained on large datasets. While existing condensation approaches are tailored to classification tasks for continuous data like images or embeddings, direct application of them to CBR has limitations. To bridge this gap, we investigate efficient dataset condensation for content-based recommendation. Inspired by the remarkable abilities of large language models (LLMs) in text comprehension and generation, we leverage LLMs to empower the generation of textual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Recommender Systems and Techniques · Multimodal Machine Learning Applications