Towards Robust Extractive Question Answering Models: Rethinking the Training Methodology
Son Quoc Tran, Matt Kretchmar

TL;DR
This paper introduces a new training approach for extractive question answering models that enhances robustness to distribution shifts and adversarial attacks without sacrificing in-domain accuracy.
Contribution
It proposes a novel loss function and challenges an implicit assumption in existing datasets, leading to more robust EQA models with improved out-of-domain and adversarial performance.
Findings
Overall F1 score improved by 5.7 points
Enhanced robustness against adversarial attacks
Maintains in-domain performance
Abstract
This paper proposes a novel training method to improve the robustness of Extractive Question Answering (EQA) models. Previous research has shown that existing models, when trained on EQA datasets that include unanswerable questions, demonstrate a significant lack of robustness against distribution shifts and adversarial attacks. Despite this, the inclusion of unanswerable questions in EQA training datasets is essential for ensuring real-world reliability. Our proposed training method includes a novel loss function for the EQA problem and challenges an implicit assumption present in numerous EQA datasets. Models trained with our method maintain in-domain performance while achieving a notable improvement on out-of-domain datasets. This results in an overall F1 score improvement of 5.7 across all testing sets. Furthermore, our models exhibit significantly enhanced robustness against two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Expert finding and Q&A systems
