DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain   Question Answering over Knowledge Base and Text

Wenting Zhao; Ye Liu; Tong Niu; Yao Wan; Philip S. Yu; Shafiq Joty,; Yingbo Zhou; Semih Yavuz

arXiv:2310.20170·cs.CL·November 1, 2023·1 cites

DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and Text

Wenting Zhao, Ye Liu, Tong Niu, Yao Wan, Philip S. Yu, Shafiq Joty,, Yingbo Zhou, Semih Yavuz

PDF

Open Access 1 Video

TL;DR

This paper introduces DIVKNOWQA, a benchmark and method for evaluating and improving LLM reasoning by integrating structured knowledge graphs and unstructured text in open-domain question answering, emphasizing multi-source retrieval and symbolic query generation.

Contribution

The paper presents a new dataset and approach that combine structured and unstructured knowledge sources, along with a retrieval method that enhances LLM reasoning capabilities.

Findings

01

Model outperforms previous approaches significantly.

02

Effective retrieval from both knowledge base and text.

03

Addresses multi-source and symbolic query challenges.

Abstract

Large Language Models (LLMs) have exhibited impressive generation capabilities, but they suffer from hallucinations when solely relying on their internal knowledge, especially when answering questions that require less commonly known information. Retrieval-augmented LLMs have emerged as a potential solution to ground LLMs in external knowledge. Nonetheless, recent approaches have primarily emphasized retrieval from unstructured text corpora, owing to its seamless integration into prompts. When using structured data such as knowledge graphs, most methods simplify it into natural text, neglecting the underlying structures. Moreover, a significant gap in the current landscape is the absence of a realistic benchmark for evaluating the effectiveness of grounding LLMs on heterogeneous knowledge sources (e.g., knowledge base and text). To fill this gap, we have curated a comprehensive dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and Text· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks

MethodsBalanced Selection