Will this Question be Answered? Question Filtering via Answer Model Distillation for Efficient Question Answering
Siddhant Garg, Alessandro Moschitti

TL;DR
This paper introduces a method to improve QA system efficiency by filtering questions using distilled models that predict answerability, significantly reducing computation with minimal recall loss.
Contribution
The authors propose a novel question filtering approach using Transformer-based models distilled from answer models, enabling efficient QA with minimal accuracy loss.
Findings
Question filtering reduces computation by ~60%.
Distilled models accurately approximate QA confidence scores.
Minimal recall loss (~3-4%) with significant efficiency gains.
Abstract
In this paper we propose a novel approach towards improving the efficiency of Question Answering (QA) systems by filtering out questions that will not be answered by them. This is based on an interesting new finding: the answer confidence scores of state-of-the-art QA systems can be approximated well by models solely using the input question text. This enables preemptive filtering of questions that are not answered by the system due to their answer confidence scores being lower than the system threshold. Specifically, we learn Transformer-based question models by distilling Transformer-based answering models. Our experiments on three popular QA datasets and one industrial QA benchmark demonstrate the ability of our question models to approximate the Precision/Recall curves of the target QA system well. These question models, when used as filters, can effectively trade off lower…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
