Dataset Condensation for Recommendation
Jiahao Wu, Wenqi Fan, Jingfan Chen, Shengcai Liu, Qijiong, Liu, Rui He, Qing Li, Ke Tang

TL;DR
This paper introduces DConRec, a novel dataset condensation framework for recommendation systems that synthesizes small, informative datasets by modeling discrete user-item interactions and incorporating user preferences, resulting in efficient training.
Contribution
The paper presents a lightweight, probabilistic condensation method tailored for recommendation datasets, addressing limitations of existing approaches in generating discrete interactions and preserving preferences.
Findings
Effective in synthesizing small, informative datasets
Accelerates data synthesis with lightweight policy gradient
Demonstrates superior performance on real-world datasets
Abstract
Training recommendation models on large datasets requires significant time and resources. It is desired to construct concise yet informative datasets for efficient training. Recent advances in dataset condensation show promise in addressing this problem by synthesizing small datasets. However, applying existing methods of dataset condensation to recommendation has limitations: (1) they fail to generate discrete user-item interactions, and (2) they could not preserve users' potential preferences. To address the limitations, we propose a lightweight condensation framework tailored for recommendation (DConRec), focusing on condensing user-item historical interaction sets. Specifically, we model the discrete user-item interactions via a probabilistic approach and design a pre-augmentation module to incorporate the potential preferences of users into the condensed datasets. While the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Stochastic Gradient Optimization Techniques
