Detecting labeling bias using influence functions

Frida J{\o}rgensen; Nina Weng; Siavash Bigdeli

arXiv:2602.19130·cs.LG·February 24, 2026

Detecting labeling bias using influence functions

Frida J{\o}rgensen, Nina Weng, Siavash Bigdeli

PDF

Open Access

TL;DR

This paper explores the use of influence functions to detect labeling bias and mislabeled samples in datasets, demonstrating promising results on MNIST and CheXpert datasets.

Contribution

It introduces a novel pipeline leveraging influence functions to identify labeling errors, especially in complex datasets like CheXpert.

Findings

01

Successfully detected nearly 90% of mislabeled samples in MNIST.

02

Mislabeled samples in CheXpert show higher influence scores.

03

Influence functions can effectively reveal label errors in real-world datasets.

Abstract

Labeling bias arises during data collection due to resource limitations or unconscious bias, leading to unequal label error rates across subgroups or misrepresentation of subgroup prevalence. Most fairness constraints assume training labels reflect the true distribution, rendering them ineffective when labeling bias is present; leaving a challenging question, that \textit{how can we detect such labeling bias?} In this work, we investigate whether influence functions can be used to detect labeling bias. Influence functions estimate how much each training sample affects a model's predictions by leveraging the gradient and Hessian of the loss function -- when labeling errors occur, influence functions can identify wrongly labeled samples in the training set, revealing the underlying failure mode. We develop a sample valuation pipeline and test it first on the MNIST dataset, then scaled to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Imbalanced Data Classification Techniques · Explainable Artificial Intelligence (XAI)