Dataset Regeneration for Sequential Recommendation
Mingjia Yin, Hao Wang, Wei Guo, Yong Liu, Suojuan Zhang, Sirui Zhao,, Defu Lian, Enhong Chen

TL;DR
This paper introduces a data-centric approach for sequential recommendation systems, focusing on regenerating high-quality datasets that improve model performance across multiple architectures and datasets.
Contribution
The paper proposes DR4SR, a model-agnostic dataset regeneration framework, and DR4SR+, a model-aware personalizer, advancing data-centric AI in sequential recommendation.
Findings
Significant performance improvements across four datasets.
Frameworks are compatible with various model-centric methods.
In-depth analysis supports the potential of data-centric paradigm.
Abstract
The sequential recommender (SR) system is a crucial component of modern recommender systems, as it aims to capture the evolving preferences of users. Significant efforts have been made to enhance the capabilities of SR systems. These methods typically follow the model-centric paradigm, which involves developing effective models based on fixed datasets. However, this approach often overlooks potential quality issues and flaws inherent in the data. Driven by the potential of data-centric AI, we propose a novel data-centric paradigm for developing an ideal training dataset using a model-agnostic dataset regeneration framework called DR4SR. This framework enables the regeneration of a dataset with exceptional cross-architecture generalizability. Additionally, we introduce the DR4SR+ framework, which incorporates a model-aware dataset personalizer to tailor the regenerated dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data Technologies and Applications · Recommender Systems and Techniques
