Adversarial Question Answering Robustness: A Multi-Level Error Analysis and Mitigation Study

Agniv Roy Choudhury; Vignesh Ponselvan Rajasingh

arXiv:2601.02700·cs.CL·January 7, 2026

Adversarial Question Answering Robustness: A Multi-Level Error Analysis and Mitigation Study

Agniv Roy Choudhury, Vignesh Ponselvan Rajasingh

PDF

Open Access

TL;DR

This study systematically analyzes adversarial vulnerabilities in transformer-based QA systems, identifies key failure modes, and demonstrates that targeted mitigation strategies, especially entity-aware contrastive learning, significantly improve robustness without sacrificing accuracy.

Contribution

It introduces a comprehensive multi-level error analysis framework and proposes NER-guided contrastive learning as an effective mitigation strategy for adversarial QA robustness.

Findings

01

Scaling models improves robustness and accuracy.

02

Optimal adversarial fine-tuning ratio is 80% clean, 20% adversarial.

03

Entity-aware contrastive learning reduces the adversarial gap to near parity.

Abstract

Question answering (QA) systems achieve impressive performance on standard benchmarks like SQuAD, but remain vulnerable to adversarial examples. This project investigates the adversarial robustness of transformer models on the AddSent adversarial dataset through systematic experimentation across model scales and targeted mitigation strategies. We perform comprehensive multi-level error analysis using five complementary categorization schemes, identifying negation confusion and entity substitution as the primary failure modes. Through systematic evaluation of adversarial fine-tuning ratios, we identify 80% clean + 20% adversarial data as optimal. Data augmentation experiments reveal a capacity bottleneck in small models. Scaling from ELECTRA-small (14M parameters) to ELECTRA-base (110M parameters) eliminates the robustness-accuracy trade-off, achieving substantial improvements on both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Advanced Graph Neural Networks