Recursive Inference Scaling: A Winning Path to Scalable Inference in Language and Multimodal Systems
Ibrahim Alabdulmohsin, Xiaohua Zhai

TL;DR
The paper introduces Recursive INference Scaling (RINS), a recursive inference method that significantly enhances language and multimodal system performance, outperforming existing strategies and improving scaling laws.
Contribution
RINS is a novel recursive inference technique that outperforms prior methods, improves performance across tasks, and enhances data scaling laws in language and multimodal models.
Findings
RINS outperforms +55 variants, including RAO and latent recurrent thinking.
RINS improves 0-shot ImageNet accuracy by +2% in multimodal systems.
RINS enhances asymptotic performance limits and scaling exponents.
Abstract
Inspired by recent findings on the fractal geometry of language, we introduce Recursive INference Scaling (RINS) as a complementary, plug-in recipe for scaling inference time in language and multimodal systems. RINS is a particular form of recursive depth that significantly outperforms +55 other variants, including the recent "repeat-all-over" (RAO) strategy in Mobile LLM (Liu et al., 2024) and latent recurrent thinking (Geiping et al., 2025). Unlike prior works, we carry out our comparisons on a compute-matched regime, and demonstrate that for a fixed model size and training compute budget, RINS substantially improves language modeling performance. It also generalizes beyond pure language tasks, delivering gains in multimodal systems, including a +2% improvement in 0-shot ImageNet accuracy for SigLIP-B/16. Additionally, by deriving data scaling laws, we show that RINS improves both the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Image Retrieval and Classification Techniques · Image Processing and 3D Reconstruction
