Selective Question Answering under Domain Shift

Amita Kamath; Robin Jia; Percy Liang

arXiv:2006.09462·cs.CL·June 18, 2020·5 cites

Selective Question Answering under Domain Shift

Amita Kamath, Robin Jia, Percy Liang

PDF

Open Access 2 Repos

TL;DR

This paper introduces a method for selective question answering under domain shift, using a calibrator to improve abstention decisions and maintain high accuracy on mixed in-domain and out-of-domain data.

Contribution

It proposes a calibrator-based approach that leverages out-of-domain data to better identify errors, enhancing abstention policies in QA models under domain shift.

Findings

01

Answers 56% of questions at 80% accuracy

02

Outperforms probability-based abstention methods

03

Effective on multiple QA datasets

Abstract

To avoid giving wrong answers, question answering (QA) models need to know when to abstain from answering. Moreover, users often ask questions that diverge from the model's training data, making errors more likely and thus abstention more critical. In this work, we propose the setting of selective question answering under domain shift, in which a QA model is tested on a mixture of in-domain and out-of-domain data, and must answer (i.e., not abstain on) as many questions as possible while maintaining high accuracy. Abstention policies based solely on the model's softmax probabilities fare poorly, since models are overconfident on out-of-domain inputs. Instead, we train a calibrator to identify inputs on which the QA model errs, and abstain when it predicts an error is likely. Crucially, the calibrator benefits from observing the model's behavior on out-of-domain data, even if from a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning

MethodsSoftmax