Introspective Distillation for Robust Question Answering

Yulei Niu; Hanwang Zhang

arXiv:2111.01026·cs.CV·November 2, 2021·33 cites

Introspective Distillation for Robust Question Answering

Yulei Niu, Hanwang Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Introspective Distillation, a novel debiasing technique for question answering models that balances out-of-distribution robustness with in-distribution accuracy by distinguishing between factual and counterfactual training samples.

Contribution

The paper proposes a new debiasing method called IntroD that blends inductive biases for both OOD and ID data through introspection, improving QA model robustness.

Findings

01

IntroD maintains competitive OOD performance.

02

IntroD improves ID performance over non-debiasing methods.

03

IntroD effectively balances robustness and accuracy.

Abstract

Question answering (QA) models are well-known to exploit data bias, e.g., the language prior in visual QA and the position bias in reading comprehension. Recent debiasing methods achieve good out-of-distribution (OOD) generalizability with a considerable sacrifice of the in-distribution (ID) performance. Therefore, they are only applicable in domains where the test distribution is known in advance. In this paper, we present a novel debiasing method called Introspective Distillation (IntroD) to make the best of both worlds for QA. Our key technical contribution is to blend the inductive bias of OOD and ID by introspecting whether a training sample fits in the factual ID world or the counterfactual OOD one. Experiments on visual QA datasets VQA v2, VQA-CP, and reading comprehension dataset SQuAD demonstrate that our proposed IntroD maintains the competitive OOD performance compared to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuleiniu/introd
pytorchOfficial

Videos

Introspective Distillation for Robust Question Answering· slideslive

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling

MethodsTest