Probabilistic Tiny Recursive Model

Amin Sghaier; Ali Parviz; Alexia Jolicoeur-Martineau

arXiv:2605.19943·cs.AI·May 20, 2026

Probabilistic Tiny Recursive Model

Amin Sghaier, Ali Parviz, Alexia Jolicoeur-Martineau

PDF

TL;DR

Probabilistic TRM introduces stochastic exploration by injecting Gaussian noise during recursive reasoning, significantly improving accuracy on reasoning benchmarks without retraining or task-specific modifications.

Contribution

It proposes a task-agnostic, test-time compute scaling framework that enhances Tiny Recursive Models through stochastic exploration, leading to substantial accuracy improvements.

Findings

01

Achieved 98.75% accuracy on Sudoku-Extreme, up from 87.4%.

02

Improved puzzle-solving accuracy from 62.6% to 91.2% on Pencil Puzzle Bench.

03

Nearly doubled accuracy compared to frontier LLMs at minimal computational cost.

Abstract

Tiny Recursive Models (TRM) solve complex reasoning tasks with a fraction of the parameters of modern large language models (LLMs) by iteratively refining a latent state and final answer. While powerful, their deterministic recursion can lead to convergence at suboptimal solutions, without escape mechanism. A common workaround relies on task-specific input perturbations at test time combined with answer aggregation via voting. We introduce Probabilistic TRM (PTRM), a task-agnostic framework for test-time compute scaling that addresses this limitation through stochastic exploration. PTRM injects Gaussian noise at each deep recursion step, enabling parallel trajectories to explore diverse solution basins, and selects among them using the model's existing Q head (used for early stopping in the original TRM). Without requiring retraining or task-specific augmentations, PTRM enables…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.