Causality-aware counterfactual confounding adjustment for feature representations learned by deep models
Elias Chaibub Neto

TL;DR
This paper introduces a causality-aware method to deconfound feature representations learned by deep neural networks, improving robustness against dataset biases and shifts by leveraging linear models for deconfounding.
Contribution
It adapts counterfactual deconfounding techniques from linear causal models to deep neural network features, enabling effective bias mitigation.
Findings
Effective in reducing confounding effects on deep features
Improves model stability under dataset shifts
Validated on colored MNIST datasets
Abstract
Causal modeling has been recognized as a potential solution to many challenging problems in machine learning (ML). Here, we describe how a recently proposed counterfactual approach developed to deconfound linear structural causal models can still be used to deconfound the feature representations learned by deep neural network (DNN) models. The key insight is that by training an accurate DNN using softmax activation at the classification layer, and then adopting the representation learned by the last layer prior to the output layer as our features, we have that, by construction, the learned features will fit well a (multi-class) logistic regression model, and will be linearly associated with the labels. As a consequence, deconfounding approaches based on simple linear models can be used to deconfound the feature representations learned by DNNs. We validate the proposed methodology using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Domain Adaptation and Few-Shot Learning · Statistical Methods and Inference
MethodsLogistic Regression · Softmax
