Missingness Bias Calibration in Feature Attribution Explanations

Shailesh Sridhar; Anton Xue; Eric Wong

arXiv:2603.04831·cs.LG·March 24, 2026

Missingness Bias Calibration in Feature Attribution Explanations

Shailesh Sridhar, Anton Xue, Eric Wong

PDF

Open Access 3 Reviews

TL;DR

This paper introduces MCal, a lightweight post-hoc correction method that effectively reduces missingness bias in feature attribution explanations without retraining models, improving reliability across various domains.

Contribution

The paper challenges the view of missingness bias as a deep flaw and proposes MCal, a simple linear correction method that outperforms complex solutions.

Findings

01

MCal significantly reduces missingness bias in feature importance scores.

02

MCal is competitive with or better than heavyweight approaches.

03

Effective across vision, language, and tabular medical benchmarks.

Abstract

Popular explanation methods often produce unreliable feature importance scores due to missingness bias, a systematic distortion that arises when models are probed with ablated, out-of-distribution inputs. Existing solutions treat this as a deep representational flaw that requires expensive retraining or architectural modifications. In this work, we challenge this assumption and show that missingness bias can be effectively treated as a superficial artifact of the model's output space. We introduce MCal, a lightweight post-hoc method that corrects this bias by fine-tuning a simple linear head on the outputs of a frozen base model. Surprisingly, we find this simple correction consistently reduces missingness bias and is competitive with, or even outperforms, prior heavyweight approaches across diverse medical benchmarks spanning vision, language, and tabular domains.

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 4Confidence 4

Strengths

- The proposed method is straightforward and well-presented, but the analyses and discussions on its impacts lack depth.

Weaknesses

- In fact, if taking a data augmentation perspective, the proposal of this work is not brand new. - While the experiments focus on elaborating how MCal corrects the missingness bias, the results fail to show how the calibration on model outcomes affects explanation quality. There is no qualitative example that illustrates the differences, nor quantitative assessments following standard regimes that effectively compare explainers under different settings.

Reviewer 02Rating 6Confidence 3

Strengths

1) The proposed calibration technique is lightweight, not requiring retraining, access to weights, or architectural modifications of the original model. 2) The proposed technique can be well adapted to common perturbation-based explainability techniques. 3)The paper is generally well written, easy to follow, and the motivation is well presented.

Weaknesses

1)In baseline comparisons, for the replace and retrain categories, the paper uses basic techniques instead of comparing against more advanced techniques in those categories. For example, in the retrain category, some of the recent techniques for imputation, like ROAD, GOAR, are not compared against. Since these are relevant methods to tackle the missingness bias, it would be better to compare them. 2)The paper claims the missingness bias is a superficial artifact, while the original model embed

Reviewer 03Rating 4Confidence 4

Strengths

- This paper is well-motivated and self-contained, with a clear description of the proposed method. - The described plugin is flexible and can be appended to arbitrary models regardless of input modalities or model architectures. - The empirical results demonstrate the capability of MCal in aligning the predictions on manipulated inputs with the standard model outcomes, supporting the method design.

Weaknesses

- The discussion on the central motivation is insufficient. Particularly, line 120 states that “masking non-critical regions …”, yet it is arguable that considering certain regions as “critical” already injects human inductive biases. If the model truly learned to look at a meaningless/wrong region, would this additional “correction” for the missingness bias consequently cover up its problematic behavior? - While the objective of the paper is clearly stated, the line of discussion deviates from

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare