Unsupervised Multi-hop Question Answering by Question Generation

Liangming Pan; Wenhu Chen; Wenhan Xiong; Min-Yen Kan; William Yang; Wang

arXiv:2010.12623·cs.CL·April 13, 2021

Unsupervised Multi-hop Question Answering by Question Generation

Liangming Pan, Wenhu Chen, Wenhan Xiong, Min-Yen Kan, William Yang, Wang

PDF

1 Repo

TL;DR

This paper introduces MQA-QG, an unsupervised framework for multi-hop question answering that generates training data from various sources, enabling effective QA models without human-labeled multi-hop questions.

Contribution

The paper presents a novel unsupervised method for generating multi-hop QA training data, reducing reliance on human annotations and improving training efficiency.

Findings

01

Achieves 61% of supervised performance on HybridQA

02

Achieves 83% of supervised performance on HotpotQA

03

Pretraining with generated data reduces need for human annotations

Abstract

Obtaining training data for multi-hop question answering (QA) is time-consuming and resource-intensive. We explore the possibility to train a well-performed multi-hop QA model without referencing any human-labeled multi-hop question-answer pairs, i.e., unsupervised multi-hop QA. We propose MQA-QG, an unsupervised framework that can generate human-like multi-hop training data from both homogeneous and heterogeneous data sources. MQA-QG generates questions by first selecting/generating relevant information from each data source and then integrating the multiple information to form a multi-hop question. Using only generated training data, we can train a competent multi-hop QA which achieves 61% and 83% of the supervised learning performance for the HybridQA and the HotpotQA dataset, respectively. We also show that pretraining the QA system with the generated data would greatly reduce the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

teacherpeterpan/Unsupervised-Multi-hop-QA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.