Unsupervised Open-Domain Question Answering

Pengfei Zhu; Xiaoguang Li; Jian Li; Hai Zhao

arXiv:2108.13817·cs.CL·September 1, 2021

Unsupervised Open-Domain Question Answering

Pengfei Zhu, Xiaoguang Li, Jian Li, Hai Zhao

PDF

Open Access

TL;DR

This paper introduces the first approach to unsupervised open-domain question answering, proposing data construction methods that enable the task to reach up to 86% of supervised performance.

Contribution

It pioneers the task of unsupervised ODQA and develops key data construction techniques to facilitate its development.

Findings

01

Unsupervised ODQA can achieve up to 86% of supervised performance.

02

Proposed data construction methods are effective for unsupervised ODQA.

03

First formal introduction of unsupervised ODQA in the literature.

Abstract

Open-domain Question Answering (ODQA) has achieved significant results in terms of supervised learning manner. However, data annotation cannot also be irresistible for its huge demand in an open domain. Though unsupervised QA or unsupervised Machine Reading Comprehension (MRC) has been tried more or less, unsupervised ODQA has not been touched according to our best knowledge. This paper thus pioneers the work of unsupervised ODQA by formally introducing the task and proposing a series of key data construction methods. Our exploration in this work inspiringly shows unsupervised ODQA can reach up to 86% performance of supervised ones.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications