Bias In, Bias Out? Finding Unbiased Subnetworks in Vanilla Models

Ivan Luiz De Moura Matos; Abdel Djalil Sad Saoud; Ekaterina Iakovleva; Vito Paolo Pastore; Enzo Tartaglione

arXiv:2603.05582·cs.LG·May 14, 2026

Bias In, Bias Out? Finding Unbiased Subnetworks in Vanilla Models

Ivan Luiz De Moura Matos, Abdel Djalil Sad Saoud, Ekaterina Iakovleva, Vito Paolo Pastore, Enzo Tartaglione

PDF

TL;DR

This paper presents BISE, a method to extract bias-free subnetworks from vanilla-trained models through pruning, enabling bias mitigation without retraining or additional data, and demonstrating efficiency and effectiveness.

Contribution

Introducing BISE, a novel pruning-based technique to identify and extract unbiased subnetworks from standard models without retraining or dataset manipulation.

Findings

01

Subnetworks can be extracted via pruning without retraining.

02

Extracted subnetworks operate with less biased features.

03

Approach improves efficiency and maintains robust performance.

Abstract

The issue of algorithmic biases in deep learning has led to the development of various debiasing techniques, many of which perform complex training procedures or dataset manipulation. However, an intriguing question arises: is it possible to extract fair and bias-agnostic subnetworks from standard vanilla-trained models without relying on additional data, such as unbiased training set? In this work, we introduce Bias-Invariant Subnetwork Extraction (BISE), a learning strategy that identifies and isolates "bias-free" subnetworks that already exist within conventionally trained models, without retraining or finetuning the original parameters. Our approach demonstrates that such subnetworks can be extracted via pruning and can operate without modification, effectively relying less on biased features and maintaining robust performance. Our findings contribute towards efficient bias…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.