Open Domain Question Answering with A Unified Knowledge Interface

Kaixin Ma; Hao Cheng; Xiaodong Liu; Eric Nyberg; Jianfeng Gao

arXiv:2110.08417·cs.CL·March 22, 2022

Open Domain Question Answering with A Unified Knowledge Interface

Kaixin Ma, Hao Cheng, Xiaodong Liu, Eric Nyberg, Jianfeng Gao

PDF

Open Access 1 Repo

TL;DR

This paper introduces UDT-QA, a unified framework that uses verbalized structured knowledge from tables and graphs to improve open-domain question answering, achieving state-of-the-art results.

Contribution

It proposes a novel verbalizer-retriever-reader framework that encodes structured knowledge as text for ODQA, bridging the gap between data and text sources.

Findings

01

UDT-QA outperforms text-only baselines on Natural Questions.

02

Verbalized knowledge enhances answer reasoning.

03

The approach sets a new single-model state-of-the-art.

Abstract

The retriever-reader framework is popular for open-domain question answering (ODQA) due to its ability to use explicit knowledge. Although prior work has sought to increase the knowledge coverage by incorporating structured knowledge beyond text, accessing heterogeneous knowledge sources through a unified interface remains an open question. While data-to-text generation has the potential to serve as a universal interface for data and text, its feasibility for downstream tasks remains largely unknown. In this work, we bridge this gap and use the data-to-text method as a means for encoding structured knowledge for ODQA. Specifically, we propose a verbalizer-retriever-reader framework for ODQA over data and text where verbalized tables from Wikipedia and graphs from Wikidata are used as augmented knowledge sources. We show that our Unified Data and Text QA, UDT-QA, can effectively benefit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mayer123/udt-qa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications