Reversal Invariance in Autoregressive Language Models

Mihir Sahasrabudhe

arXiv:2511.00341·cs.CL·November 4, 2025

Reversal Invariance in Autoregressive Language Models

Mihir Sahasrabudhe

PDF

Open Access

TL;DR

This paper introduces the concept of reversal invariance in autoregressive language models, showing that they are inherently symmetric to text reversal, which may limit their ability to model directional language features.

Contribution

It formalizes reversal invariance as a structural property of CLM, analyzes its implications, and proposes the need for asymmetric objectives to better capture language directionality.

Findings

01

Models trained on reversed text perform comparably to forward-trained models.

02

Reversal invariance explains why standard CLM is direction-blind.

03

Current objectives may fail to encode directional dependencies in language.

Abstract

We formalize a structural property of the causal (autoregressive) language modeling (CLM) objective: reversal invariance. Formally, the next-token prediction loss assigns identical likelihood to a corpus and its reversal, implying that standard CLM pretraining is direction-blind. This symmetry explains why models trained on reversed text can achieve comparable performance to those trained on forward text, despite the inherently time-asymmetric nature of human language and reasoning. We argue that this invariance represents a limitation of current pretraining objectives rather than a benign artifact. If natural language encodes directional dependencies - phonological, morphological, or causal - a symmetric objective may fail to capture them. We therefore propose viewing pretraining through the lens of temporal asymmetry, motivating future work on loss functions and architectures that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods · Multimodal Machine Learning Applications