Large Language Models are Limited in Out-of-Context Knowledge Reasoning

Peng Hu; Changjiang Gao; Ruiqi Gao; Jiajun Chen; and Shujian Huang

arXiv:2406.07393·cs.CL·September 30, 2024

Large Language Models are Limited in Out-of-Context Knowledge Reasoning

Peng Hu, Changjiang Gao, Ruiqi Gao, Jiajun Chen, and Shujian Huang

PDF

Open Access 1 Repo

TL;DR

This paper systematically evaluates the out-of-context knowledge reasoning abilities of large language models using a synthetic dataset, revealing their limitations in combining multiple knowledge sources and transferring knowledge across languages.

Contribution

The paper introduces a synthetic dataset with seven OCKR tasks to assess LLMs' out-of-context reasoning, highlighting their limited capabilities and challenges in knowledge retrieval and cross-lingual transfer.

Findings

01

LLMs show limited out-of-context knowledge reasoning ability.

02

Training with reasoning examples does not significantly improve OCKR.

03

Explicit knowledge retrieval training aids attribute knowledge retrieval but not relation knowledge.

Abstract

Large Language Models (LLMs) possess extensive knowledge and strong capabilities in performing in-context reasoning. However, previous work challenges their out-of-context reasoning ability, i.e., the ability to infer information from their training data, instead of from the context or prompt. This paper focuses on a significant aspect of out-of-context reasoning: Out-of-Context Knowledge Reasoning (OCKR), which is to combine multiple knowledge to infer new knowledge. We designed a synthetic dataset with seven representative OCKR tasks to systematically assess the OCKR capabilities of LLMs. Using this dataset, we evaluated several LLMs and discovered that their proficiency in this aspect is limited, regardless of whether the knowledge is trained in a separate or adjacent training settings. Moreover, training the model to reason with reasoning examples does not result in significant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

njunlp/id-ockr
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Semantic Web and Ontologies · Natural Language Processing Techniques