Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Zhengkai Lin, Zhihang Fu, Kai Liu, Liang Xie, Binbin Lin, Wenxiao, Wang, Deng Cai, Yue Wu, Jieping Ye

TL;DR
This paper investigates the reversal curse in large language models, revealing how their ability to generalize facts depends on context structure and inherent biases, with implications for improving learning methods.
Contribution
The study uncovers the structural and bias-related factors affecting LLMs' fact generalization and proposes insights for enhancing their learning mechanisms.
Findings
LLMs generalize 'B is A' when both A and B are contextually presented.
Generalization depends on the structure of the training data, e.g., biographies.
Inherent biases in fact recall influence downstream performance and are hard to mitigate.
Abstract
While large language models (LLMs) showcase unprecedented capabilities, they also exhibit certain inherent limitations when facing seemingly trivial tasks. A prime example is the recently debated "reversal curse", which surfaces when models, having been trained on the fact "A is B", struggle to generalize this knowledge to infer that "B is A". In this paper, we examine the manifestation of the reversal curse across various tasks and delve into both the generalization abilities and the problem-solving mechanisms of LLMs. This investigation leads to a series of significant insights: (1) LLMs are able to generalize to "B is A" when both A and B are presented in the context as in the case of a multiple-choice question. (2) This generalization ability is highly correlated to the structure of the fact "A is B" in the training documents. For example, this generalization only applies to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
