Single-weight Model Editing for Post-hoc Spurious Correlation Neutralization
Shahin Hakemi, Naveed Akhtar, Ghulam Mubashar Hassan, Ajmal Mian

TL;DR
This paper introduces a novel post-hoc method to neutralize spurious correlations in neural networks by editing a single weight, effectively reducing reliance on misleading features with minimal performance loss.
Contribution
The paper presents a unique single-weight modification technique for post-hoc spurious feature neutralization, offering a practical and efficient alternative to existing methods.
Findings
Single-weight editing effectively neutralizes spurious features.
Method achieves competitive or superior performance.
Negligible impact on overall model accuracy.
Abstract
Neural network training tends to exploit the simplest features as shortcuts to greedily minimize training loss. However, some of these features might be spuriously correlated with the target labels, leading to incorrect predictions by the model. Several methods have been proposed to address this issue. Focusing on suppressing the spurious correlations with model training, they not only incur additional training cost, but also have limited practical utility as the model misbehavior due to spurious relations is usually discovered after its deployment. It is also often overlooked that spuriousness is a subjective notion. Hence, the precise questions that must be investigated are; to what degree a feature is spurious, and how we can proportionally distract the model's attention from it for reliable prediction. To this end, we propose a method that enables post-hoc neutralization of spurious…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Adaptive Filtering Techniques · Blind Source Separation Techniques · Wireless Communication Networks Research
MethodsSoftmax · Attention Is All You Need
