Reasoning Factual Knowledge in Structured Data with Large Language   Models

Sirui Huang; Yanggan Gu; Xuming Hu; Zhonghao Li; Qing Li; Guandong Xu

arXiv:2408.12188·cs.CL·August 23, 2024

Reasoning Factual Knowledge in Structured Data with Large Language Models

Sirui Huang, Yanggan Gu, Xuming Hu, Zhonghao Li, Qing Li, Guandong Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces StructFact, a benchmark to evaluate large language models' ability to reason with structured data, revealing current limitations and guiding future improvements in factual knowledge inference.

Contribution

The paper presents StructFact, a comprehensive benchmark for assessing LLMs' reasoning over structured data, highlighting their current shortcomings and fostering advancements in knowledge reasoning.

Findings

01

LLMs show limited ability to infer factual knowledge from structured data.

02

The benchmark covers diverse tasks, domains, and regions for comprehensive evaluation.

03

Current training strategies do not fully address the challenges of reasoning with structured data.

Abstract

Large language models (LLMs) have made remarkable progress in various natural language processing tasks as a benefit of their capability to comprehend and reason with factual knowledge. However, a significant amount of factual knowledge is stored in structured data, which possesses unique characteristics that differ from the unstructured texts used for pretraining. This difference can introduce imperceptible inference parameter deviations, posing challenges for LLMs in effectively utilizing and reasoning with structured data to accurately infer factual knowledge. To this end, we propose a benchmark named StructFact, to evaluate the structural reasoning capabilities of LLMs in inferring factual knowledge. StructFact comprises 8,340 factual questions encompassing various tasks, domains, timelines, and regions. This benchmark allows us to investigate the capability of LLMs across five…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

egangu/structfact
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Data Quality and Management · Natural Language Processing Techniques

MethodsSparse Evolutionary Training