TL;DR
Attractor Models enhance language modeling and reasoning by iteratively refining representations through fixed-point solving, outperforming traditional Transformers in efficiency and accuracy.
Contribution
This paper introduces Attractor Models that stabilize and improve looped transformers using implicit differentiation and fixed-point convergence, enabling scalable iterative refinement.
Findings
Outperform standard Transformers with up to 46.6% perplexity reduction
Achieve 91.4% accuracy on Sudoku-Extreme with only 27M parameters
Show stable training and inference with adaptive iteration and equilibrium internalization
Abstract
Looped Transformers offer a promising alternative to purely feed-forward computation by iteratively refining latent representations, improving language modeling and reasoning. Yet recurrent architectures remain unstable to train, costly to optimize and deploy, and constrained to small, fixed recurrence depths. We introduce Attractor Models, in which a backbone module first proposes output embeddings, then an attractor module refines them by solving for the fixed point, with gradients obtained through implicit differentiation. Thus, training memory remains constant in effective depth, and iterations are chosen adaptively by convergence. Empirically, Attractor Models outperform existing models across two regimes, large-scale language-model pretraining and reasoning with tiny models. In language modeling, Attractor Models deliver a Pareto improvement over standard Transformers and stable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
