Closed-Loop Transformers: Autoregressive Modeling as Iterative Latent Equilibrium
Akbar Anbar Jafari, Gholamreza Anbarjafari

TL;DR
This paper introduces Equilibrium Transformers, a novel autoregressive model that iteratively refines latent representations to address the limitations of open-loop transformers, improving long-range reasoning and factual consistency.
Contribution
The paper proposes the closed-loop prediction principle and Equilibrium Transformers, integrating iterative latent refinement with theoretical guarantees and demonstrating improved performance on complex tasks.
Findings
+3.28% improvement on binary parity task
Gains up to +8.07% on difficult sequences
Unified framework for equilibrium models and diffusion language models
Abstract
Contemporary autoregressive transformers operate in open loop: each hidden state is computed in a single forward pass and never revised, causing errors to propagate uncorrected through the sequence. We identify this open-loop bottleneck as a fundamental architectural limitation underlying well-documented failures in long-range reasoning, factual consistency, and multi-step planning. To address this limitation, we introduce the closed-loop prediction principle, which requires that models iteratively refine latent representations until reaching a self-consistent equilibrium before committing to each token. We instantiate this principle as Equilibrium Transformers (EqT), which augment standard transformer layers with an Equilibrium Refinement Module that minimizes a learned energy function via gradient descent in latent space. The energy function enforces bidirectional prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling · Domain Adaptation and Few-Shot Learning
