Form Follows Function: Recursive Stem Model

Navid Hakimi

arXiv:2603.15641·cs.AI·March 18, 2026

Form Follows Function: Recursive Stem Model

Navid Hakimi

PDF

Open Access

TL;DR

The paper introduces Recursive Stem Model (RSM), a training method for recursive reasoning networks that achieves faster training, better accuracy, and scalable inference, enabling iterative problem-solving with reliability signals.

Contribution

RSM presents a novel training approach that detaches hidden states, grows recursion depth independently, and uses stochastic depth, improving efficiency and scalability of recursive reasoning models.

Findings

01

>20x faster training than TRM with improved accuracy

02

Achieves 97.5% accuracy on Sudoku-Extreme

03

Reaches 80% accuracy on Maze-Hard in 40 minutes

Abstract

Recursive reasoning models such as Hierarchical Reasoning Model (HRM) and Tiny Recursive Model (TRM) show that small, weight-shared networks can solve compute-heavy and NP puzzles by iteratively refining latent states, but their training typically relies on deep supervision and/or long unrolls that increase wall-clock cost and can bias the model toward greedy intermediate behavior. We introduce Recursive Stem Model (RSM), a recursive reasoning approach that keeps the TRM-style backbone while changing the training contract so the network learns a stable, depth-agnostic transition operator. RSM fully detaches the hidden-state history during training, treats early iterations as detached "warm-up" steps, and applies loss only at the final step. We further grow the outer recursion depth $H$ and inner compute depth $L$ independently and use a stochastic outer-transition scheme (stochastic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Machine Learning in Materials Science