Delving into the Reversal Curse: How Far Can Large Language Models   Generalize?

Zhengkai Lin; Zhihang Fu; Kai Liu; Liang Xie; Binbin Lin; Wenxiao; Wang; Deng Cai; Yue Wu; Jieping Ye

arXiv:2410.18808·cs.CL·November 25, 2024

Delving into the Reversal Curse: How Far Can Large Language Models Generalize?

Zhengkai Lin, Zhihang Fu, Kai Liu, Liang Xie, Binbin Lin, Wenxiao, Wang, Deng Cai, Yue Wu, Jieping Ye

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates the reversal curse in large language models, revealing how their ability to generalize facts depends on context structure and inherent biases, with implications for improving learning methods.

Contribution

The study uncovers the structural and bias-related factors affecting LLMs' fact generalization and proposes insights for enhancing their learning mechanisms.

Findings

01

LLMs generalize 'B is A' when both A and B are contextually presented.

02

Generalization depends on the structure of the training data, e.g., biographies.

03

Inherent biases in fact recall influence downstream performance and are hard to mitigate.

Abstract

While large language models (LLMs) showcase unprecedented capabilities, they also exhibit certain inherent limitations when facing seemingly trivial tasks. A prime example is the recently debated "reversal curse", which surfaces when models, having been trained on the fact "A is B", struggle to generalize this knowledge to infer that "B is A". In this paper, we examine the manifestation of the reversal curse across various tasks and delve into both the generalization abilities and the problem-solving mechanisms of LLMs. This investigation leads to a series of significant insights: (1) LLMs are able to generalize to "B is A" when both A and B are presented in the context as in the case of a multiple-choice question. (2) This generalization ability is highly correlated to the structure of the fact "A is B" in the training documents. For example, this generalization only applies to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alibaba/thinking_bias
pytorchOfficial

Videos

Delving into the Reversal Curse: How Far Can Large Language Models Generalize?· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling