Counterfactual VQA: A Cause-Effect Look at Language Bias

Yulei Niu; Kaihua Tang; Hanwang Zhang; Zhiwu Lu; Xian-Sheng Hua,; Ji-Rong Wen

arXiv:2006.04315·cs.CV·April 2, 2021·23 cites

Counterfactual VQA: A Cause-Effect Look at Language Bias

Yulei Niu, Kaihua Tang, Hanwang Zhang, Zhiwu Lu, Xian-Sheng Hua,, Ji-Rong Wen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a counterfactual inference framework for VQA that effectively reduces language bias by disentangling causal effects, improving performance on biased and balanced datasets without data augmentation.

Contribution

It proposes a novel causal inference approach to mitigate language bias in VQA, applicable across various models and fusion strategies.

Findings

01

Improves VQA performance on bias-sensitive datasets

02

Reduces reliance on language priors

03

Maintains robustness on balanced datasets

Abstract

VQA models may tend to rely on language bias as a shortcut and thus fail to sufficiently learn the multi-modal knowledge from both vision and language. Recent debiasing methods proposed to exclude the language prior during inference. However, they fail to disentangle the "good" language context and "bad" language bias from the whole. In this paper, we investigate how to mitigate language bias in VQA. Motivated by causal effects, we proposed a novel counterfactual inference framework, which enables us to capture the language bias as the direct causal effect of questions on answers and reduce the language bias by subtracting the direct language effect from the total causal effect. Experiments demonstrate that our proposed counterfactual inference framework 1) is general to various VQA backbones and fusion strategies, 2) achieves competitive performance on the language-bias sensitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuleiniu/cfvqa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling