A Survey for Efficient Open Domain Question Answering

Qin Zhang; Shangsi Chen; Dongkuan Xu; Qingqing Cao; Xiaojun Chen,; Trevor Cohn; Meng Fang

arXiv:2211.07886·cs.CL·November 16, 2022·1 cites

A Survey for Efficient Open Domain Question Answering

Qin Zhang, Shangsi Chen, Dongkuan Xu, Qingqing Cao, Xiaojun Chen,, Trevor Cohn, Meng Fang

PDF

Open Access

TL;DR

This survey reviews recent advances in open domain question answering, focusing on balancing accuracy, memory use, and speed to enable practical deployment of ODQA systems.

Contribution

It provides a comprehensive overview of efficiency techniques in ODQA models, including quantitative analysis and identification of open challenges.

Findings

01

Efficiency techniques improve ODQA deployment feasibility

02

Trade-offs between accuracy, memory, and speed are analyzed

03

Open challenges in ODQA efficiency are identified

Abstract

Open domain question answering (ODQA) is a longstanding task aimed at answering factual questions from a large knowledge corpus without any explicit evidence in natural language processing (NLP). Recent works have predominantly focused on improving the answering accuracy and achieved promising progress. However, higher accuracy often comes with more memory consumption and inference latency, which might not necessarily be efficient enough for direct deployment in the real world. Thus, a trade-off between accuracy, memory consumption and processing speed is pursued. In this paper, we provide a survey of recent advances in the efficiency of ODQA models. We walk through the ODQA models and conclude the core techniques on efficiency. Quantitative analysis on memory cost, processing speed, accuracy and overall comparison are given. We hope that this work would keep interested scholars…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings