Counterfactual Generative Networks

Axel Sauer; Andreas Geiger

arXiv:2101.06046·cs.LG·January 18, 2021·29 cites

Counterfactual Generative Networks

Axel Sauer, Andreas Geiger

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a generative model that disentangles causal factors in images, enabling the creation of counterfactual images to improve robustness and interpretability in classifiers.

Contribution

It proposes a causal generative framework that decomposes images into independent mechanisms, allowing for counterfactual image generation without supervision.

Findings

01

Counterfactual images improve out-of-distribution robustness.

02

Model can generate images on MNIST and ImageNet.

03

Efficient training on a single GPU using pre-trained models.

Abstract

Neural networks are prone to learning shortcuts -- they often model simple correlations, ignoring more complex ones that potentially generalize better. Prior works on image classification show that instead of learning a connection to object shape, deep classifiers tend to exploit spurious correlations with low-level texture or the background for solving the classification task. In this work, we take a step towards more robust and interpretable classifiers that explicitly expose the task's causal structure. Building on current advances in deep generative modeling, we propose to decompose the image generation process into independent causal mechanisms that we train without direct supervision. By exploiting appropriate inductive biases, these mechanisms disentangle object shape, object texture, and background; hence, they allow for generating counterfactual images. We demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

autonomousvision/counterfactual_generative_networks
pytorchOfficial

Videos

Counterfactual Generative Networks· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning