TL;DR
LoopUS is a post-training framework that transforms pretrained LLMs into looped architectures to enhance reasoning performance efficiently, without retraining from scratch.
Contribution
It introduces a novel latent-refinement architecture with four core components, enabling stable, efficient looping in pretrained models without extensive retraining.
Findings
Improves reasoning performance without additional training.
Stabilizes latent looping against computational and representation issues.
Operates as a post-training conversion, saving computational resources.
Abstract
Looped computation shows promise in improving the reasoning-oriented performance of LLMs by scaling test-time compute. However, existing approaches typically require either training recurrent models from scratch or applying disruptive retrofits, which involve substantial computational costs and may compromise pretrained capabilities. To address these limitations, we introduce \textbf{Looped Depth Up-Scaling} (LoopUS), a post-training framework that converts a standard pretrained LLM into a looped architecture. As a key technical contribution, LoopUS recasts the pretrained LLM into an encoder, a looped reasoning block, and a decoder. It operationalizes this latent-refinement architecture through four core components: (1) block decomposition, guided by staged representation dynamics; (2) an input-dependent selective gate to mitigate hidden-state drift; (3) random deep supervision for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Thrillcrazyer/Qwen3_1.7B_LoopUS_SFTmodel· 51 dl· ♡ 151 dl♡ 1
- 🤗Thrillcrazyer/Phi4_LoopUSmodel· 97 dl· ♡ 197 dl♡ 1
- 🤗Thrillcrazyer/Qwen3-8B_LoopUSmodel· 105 dl· ♡ 1105 dl♡ 1
- 🤗Thrillcrazyer/Qwen3_1.7B_LoopUSmodel· 103 dl· ♡ 2103 dl♡ 2
- 🤗Thrillcrazyer/Qwen3-4B_LoopUSmodel· 107 dl· ♡ 2107 dl♡ 2
- 🤗Thrillcrazyer/TinyLlama_v1.1_LoopUSmodel· 96 dl· ♡ 196 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
