Contextualized Sparse Representations for Real-Time Open-Domain Question   Answering

Jinhyuk Lee; Minjoon Seo; Hannaneh Hajishirzi; Jaewoo Kang

arXiv:1911.02896·cs.CL·May 4, 2020·1 cites

Contextualized Sparse Representations for Real-Time Open-Domain Question Answering

Jinhyuk Lee, Minjoon Seo, Hannaneh Hajishirzi, Jaewoo Kang

PDF

Open Access 3 Repos

TL;DR

This paper introduces Sparc, a contextualized sparse representation method that enhances phrase embeddings for open-domain question answering, achieving higher accuracy and faster inference than existing models.

Contribution

It proposes a novel sparse vector learning approach using rectified self-attention, improving phrase retrieval accuracy and speed in open-domain QA systems.

Findings

01

Achieves over 4% improvement on CuratedTREC and SQuAD-Open datasets.

02

Outperforms previous retrieve & read models in accuracy and inference speed.

03

Provides a scalable, efficient phrase embedding method for real-time QA.

Abstract

Open-domain question answering can be formulated as a phrase retrieval problem, in which we can expect huge scalability and speed benefit but often suffer from low accuracy due to the limitation of existing phrase representation models. In this paper, we aim to improve the quality of each phrase embedding by augmenting it with a contextualized sparse representation (Sparc). Unlike previous sparse vectors that are term-frequency-based (e.g., tf-idf) or directly learned (only few thousand dimensions), we leverage rectified self-attention to indirectly learn sparse vectors in n-gram vocabulary space. By augmenting the previous phrase retrieval model (Seo et al., 2019) with Sparc, we show 4%+ improvement in CuratedTREC and SQuAD-Open. Our CuratedTREC score is even better than the best known retrieve & read model with at least 45x faster inference speed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings