Breaking the Reversal Curse in Autoregressive Language Models via Identity Bridge

Xutao Ma; Yixiao Huang; Hanlin Zhu; Somayeh Sojoudi

arXiv:2602.02470·cs.AI·February 3, 2026

Breaking the Reversal Curse in Autoregressive Language Models via Identity Bridge

Xutao Ma, Yixiao Huang, Hanlin Zhu, Somayeh Sojoudi

PDF

Open Access

TL;DR

This paper demonstrates that the reversal curse in autoregressive language models can be mitigated by a simple data regularization technique called the Identity Bridge, enabling models to learn higher-level rules rather than just memorizing facts.

Contribution

The paper introduces the Identity Bridge data recipe and provides theoretical and empirical evidence that it helps models overcome the reversal curse, a fundamental limitation in LLMs.

Findings

01

A 1B model finetuned with the recipe achieves 40% success on reversal tasks.

02

Without the recipe, models have near-zero success in reversal tasks.

03

Theoretical analysis shows even one-layer transformers can break the reversal curse with this method.

Abstract

Autoregressive large language models (LLMs) have achieved remarkable success in many complex tasks, yet they can still fail in very simple logical reasoning such as the "reversal curse" -- when trained on forward knowledge data of the form " $A \to B$ " (e.g., Alice's husband is Bob), the model is unable to deduce the reversal knowledge " $B \leftarrow A$ " (e.g., Bob's wife is Alice) during test. Extensive prior research suggests that this failure is an inherent, fundamental limit of autoregressive causal LLMs, indicating that these models tend to memorize factual-level knowledge rather than capture higher-level rules. In this paper, we challenge this view by showing that this seemingly fundamental limit can be mitigated by slightly tweaking the training data with a simple regularization data recipe called the Identity Bridge of the form " $A \to A$ " (e.g., The name of Alice is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Artificial Intelligence in Healthcare and Education