Deep Attention Reweighting: Post-Hoc Attention-Based Feature Aggregation in CNNs for Disentangling Core and Spurious Features under Spurious Correlations

Kin Whye Chew; Jingxian Wang

arXiv:2605.20732·cs.CV·May 21, 2026

Deep Attention Reweighting: Post-Hoc Attention-Based Feature Aggregation in CNNs for Disentangling Core and Spurious Features under Spurious Correlations

Kin Whye Chew, Jingxian Wang

PDF

TL;DR

This paper introduces Deep Attention Reweighting (DAR), a post-hoc attention-based method that replaces GAP in CNNs to better disentangle core features from spurious correlations, improving generalization.

Contribution

The paper proposes DAR, an attention mechanism that enhances feature disentanglement in CNNs by addressing GAP-induced entanglement, outperforming previous methods like DFR.

Findings

01

DAR consistently outperforms DFR across datasets and metrics.

02

Attention-based aggregation reduces reliance on spurious features.

03

Replacing GAP with DAR improves model generalization.

Abstract

Convolutional Neural Networks (CNNs) often exploit spurious correlations in datasets, learning superficially predictive yet causally irrelevant features, leading to poor generalization and fairness issues. Deep Feature Reweighting (DFR) is a post-hoc technique that reduces a trained model's reliance on spurious correlations by retraining its classification head on a target dataset. However, we show that DFR is fundamentally constrained by operating on entangled features, limiting its ability to amplify the core features while simultaneously suppressing the spurious ones. We trace this entanglement to the ubiquitous Global Average Pooling (GAP) layer, which indiscriminately collapses spatially distinct core and spurious features into a single representation. To address this, we propose Deep Attention Reweighting (DAR), a post-hoc attention-based aggregation module that replaces GAP and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.