Closed-Loop Transformers: Autoregressive Modeling as Iterative Latent Equilibrium

Akbar Anbar Jafari; Gholamreza Anbarjafari

arXiv:2511.21882·cs.LG·December 1, 2025

Closed-Loop Transformers: Autoregressive Modeling as Iterative Latent Equilibrium

Akbar Anbar Jafari, Gholamreza Anbarjafari

PDF

Open Access

TL;DR

This paper introduces Equilibrium Transformers, a novel autoregressive model that iteratively refines latent representations to address the limitations of open-loop transformers, improving long-range reasoning and factual consistency.

Contribution

The paper proposes the closed-loop prediction principle and Equilibrium Transformers, integrating iterative latent refinement with theoretical guarantees and demonstrating improved performance on complex tasks.

Findings

01

+3.28% improvement on binary parity task

02

Gains up to +8.07% on difficult sequences

03

Unified framework for equilibrium models and diffusion language models

Abstract

Contemporary autoregressive transformers operate in open loop: each hidden state is computed in a single forward pass and never revised, causing errors to propagate uncorrected through the sequence. We identify this open-loop bottleneck as a fundamental architectural limitation underlying well-documented failures in long-range reasoning, factual consistency, and multi-step planning. To address this limitation, we introduce the closed-loop prediction principle, which requires that models iteratively refine latent representations until reaching a self-consistent equilibrium before committing to each token. We instantiate this principle as Equilibrium Transformers (EqT), which augment standard transformer layers with an Equilibrium Refinement Module that minimizes a learned energy function via gradient descent in latent space. The energy function enforces bidirectional prediction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Topic Modeling · Domain Adaptation and Few-Shot Learning