Neural Machine Unranking
Jingrui Hou, Axel Finke, Georgina Cosma

TL;DR
This paper introduces Neural Machine UnRanking (NuMuR), a novel approach for data removal in neural IR systems, addressing challenges of unnormalised relevance scores and entangled data, using a dual-objective contrastive and consistent loss framework.
Contribution
The paper proposes CoCoL, a new loss framework for effective data unlearning in neural IR, overcoming limitations of traditional methods for unranking tasks.
Findings
CoCoL achieves significant forgetting with minimal impact on retained data.
The method outperforms existing unlearning techniques on MS MARCO and TREC datasets.
Effective data removal is demonstrated across four neural IR models.
Abstract
We address the problem of machine unlearning in neural information retrieval (IR), introducing a novel task termed Neural Machine UnRanking (NuMuR). This problem is motivated by growing demands for data privacy compliance and selective information removal in neural IR systems. Existing task- or model- agnostic unlearning approaches, primarily designed for classification tasks, are suboptimal for NuMuR due to two core challenges: (1) neural rankers output unnormalised relevance scores rather than probability distributions, limiting the effectiveness of traditional teacher-student distillation frameworks; and (2) entangled data scenarios, where queries and documents appear simultaneously across both forget and retain sets, may degrade retention performance in existing methods. To address these issues, we propose Contrastive and Consistent Loss (CoCoL), a dual-objective framework. CoCoL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification
