Extending the Scope of Out-of-Domain: Examining QA models in multiple   subdomains

Chenyang Lyu; Jennifer Foster; Yvette Graham

arXiv:2204.04534·cs.CL·April 12, 2022

Extending the Scope of Out-of-Domain: Examining QA models in multiple subdomains

Chenyang Lyu, Jennifer Foster, Yvette Graham

PDF

Open Access 1 Repo

TL;DR

This paper investigates how question answering models perform across various subdomains defined by internal dataset characteristics, revealing significant performance drops when training and testing data differ in these subdomains.

Contribution

It introduces a new perspective by analyzing QA model performance across subdomains based on internal dataset features, highlighting the limitations of current generalization assumptions.

Findings

01

Performance drops when train and test data are from different subdomains.

02

Internal dataset characteristics significantly impact QA system accuracy.

03

Current QA models may not generalize well across diverse subdomains.

Abstract

Past works that investigate out-of-domain performance of QA systems have mainly focused on general domains (e.g. news domain, wikipedia domain), underestimating the importance of subdomains defined by the internal characteristics of QA datasets. In this paper, we extend the scope of "out-of-domain" by splitting QA examples into different subdomains according to their several internal characteristics including question type, text length, answer position. We then examine the performance of QA systems trained on the data from different subdomains. Experimental results show that the performance of QA systems can be significantly reduced when the train data and test data come from different subdomains. These results question the generalizability of current QA systems in multiple subdomains, suggesting the need to combat the bias introduced by the internal characteristics of QA datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lyuchenyang/analysing-question-answering-data
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research