Towards Robust Extractive Question Answering Models: Rethinking the   Training Methodology

Son Quoc Tran; Matt Kretchmar

arXiv:2409.19766·cs.CL·October 1, 2024

Towards Robust Extractive Question Answering Models: Rethinking the Training Methodology

Son Quoc Tran, Matt Kretchmar

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a new training approach for extractive question answering models that enhances robustness to distribution shifts and adversarial attacks without sacrificing in-domain accuracy.

Contribution

It proposes a novel loss function and challenges an implicit assumption in existing datasets, leading to more robust EQA models with improved out-of-domain and adversarial performance.

Findings

01

Overall F1 score improved by 5.7 points

02

Enhanced robustness against adversarial attacks

03

Maintains in-domain performance

Abstract

This paper proposes a novel training method to improve the robustness of Extractive Question Answering (EQA) models. Previous research has shown that existing models, when trained on EQA datasets that include unanswerable questions, demonstrate a significant lack of robustness against distribution shifts and adversarial attacks. Despite this, the inclusion of unanswerable questions in EQA training datasets is essential for ensuring real-world reliability. Our proposed training method includes a novel loss function for the EQA problem and challenges an implicit assumption present in numerous EQA datasets. Models trained with our method maintain in-domain performance while achieving a notable improvement on out-of-domain datasets. This results in an overall F1 score improvement of 5.7 across all testing sets. Furthermore, our models exhibit significantly enhanced robustness against two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sonqt/robust_qa
pytorchOfficial

Videos

Towards Robust Extractive Question Answering Models: Rethinking the Training Methodology· underline

Taxonomy

TopicsTopic Modeling · Expert finding and Q&A systems