Improving QA Generalization by Concurrent Modeling of Multiple Biases

Mingzhu Wu; Nafise Sadat Moosavi; Andreas R\"uckl\'e; Iryna; Gurevych

arXiv:2010.03338·cs.CL·October 8, 2020·1 cites

Improving QA Generalization by Concurrent Modeling of Multiple Biases

Mingzhu Wu, Nafise Sadat Moosavi, Andreas R\"uckl\'e, Iryna, Gurevych

PDF

Open Access 1 Repo

TL;DR

This paper introduces a framework that improves question answering model generalization by concurrently modeling and weighting multiple biases in training data, reducing reliance on biased examples.

Contribution

It proposes a novel bias-weighting training framework that enhances in-domain and out-of-domain QA performance by addressing multiple biases simultaneously.

Findings

01

Effective in both single-domain and multi-domain settings.

02

Outperforms state-of-the-art debiasing methods.

03

Improves generalization to out-of-domain datasets.

Abstract

Existing NLP datasets contain various biases that models can easily exploit to achieve high performances on the corresponding evaluation sets. However, focusing on dataset-specific biases limits their ability to learn more generalizable knowledge about the task from more general data patterns. In this paper, we investigate the impact of debiasing methods for improving generalization and propose a general framework for improving the performance on both in-domain and out-of-domain datasets by concurrent modeling of multiple biases in the training data. Our framework weights each example based on the biases it contains and the strength of those biases in the training data. It then uses these weights in the training objective so that the model relies less on examples with high bias weights. We extensively evaluate our framework on extractive question answering with training data from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UKPLab/qa-generalization-concurrent-debiasing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques