Neural Corrective Machine Unranking
Jingrui Hou, Axel Finke, Georgina Cosma

TL;DR
This paper introduces CuRD, a novel teacher-student framework for neural IR unranking that effectively forgets specific data while preserving overall retrieval performance and integrity.
Contribution
It formalizes corrective unranking in neural IR and proposes CuRD, which improves data forgetting and correction without sacrificing model accuracy.
Findings
CuRD outperforms seven baselines in forgetting and correction.
It maintains model performance on non-forgotten data.
Effective across multiple neural IR models and datasets.
Abstract
Machine unlearning in neural information retrieval (IR) systems requires removing specific data whilst maintaining model performance. Applying existing machine unlearning methods to IR may compromise retrieval effectiveness or inadvertently expose unlearning actions due to the removal of particular items from the retrieved results presented to users. We formalise corrective unranking, which extends machine unlearning in (neural) IR context by integrating substitute documents to preserve ranking integrity, and propose a novel teacher-student framework, Corrective unRanking Distillation (CuRD), for this task. CuRD (1) facilitates forgetting by adjusting the (trained) neural IR model such that its output relevance scores of to-be-forgotten samples mimic those of low-ranking, non-retrievable samples; (2) enables correction by fine-tuning the relevance scores for the substitute samples to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning
MethodsSparse Evolutionary Training
