ERASE -- A Real-World Aligned Benchmark for Unlearning in Recommender Systems
Pierre Lubitzsch, Maarten de Rijke, Sebastian Schelter

TL;DR
ERASE is a comprehensive benchmark designed to evaluate machine unlearning methods in real-world recommender systems, addressing practical constraints and diverse scenarios to advance the field.
Contribution
It introduces ERASE, a large-scale, realistic benchmark covering multiple tasks, datasets, and algorithms, with extensive artifacts for systematic evaluation of unlearning methods.
Findings
Approximate unlearning can match retraining in some cases.
Robustness of unlearning varies across datasets and models.
Recommender-specific methods are more reliable than general-purpose ones.
Abstract
Machine unlearning (MU) enables the removal of selected training data from trained models, to address privacy compliance, security, and liability issues in recommender systems. Existing MU benchmarks poorly reflect real-world recommender settings: they focus primarily on collaborative filtering, assume unrealistically large deletion requests, and overlook practical constraints such as sequential unlearning and efficiency. We present ERASE, a large-scale benchmark for MU in recommender systems designed to align with real-world usage. ERASE spans three core tasks -- collaborative filtering, session-based recommendation, and next-basket recommendation -- and includes unlearning scenarios inspired by real-world applications, such as sequentially removing sensitive interactions or spam. The benchmark covers seven unlearning algorithms, including general-purpose and recommender-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Recommender Systems and Techniques · Adversarial Robustness in Machine Learning
