GTS: Inference-Time Scaling of Latent Reasoning with a Learnable Gaussian Thought Sampler

Minghan Wang; Ye Bai; Thuy-Trang Vu; Ehsan Shareghi; Gholamreza Haffari

arXiv:2602.14077·cs.CL·March 19, 2026

GTS: Inference-Time Scaling of Latent Reasoning with a Learnable Gaussian Thought Sampler

Minghan Wang, Ye Bai, Thuy-Trang Vu, Ehsan Shareghi, Gholamreza Haffari

PDF

Open Access

TL;DR

This paper introduces GTS, a learnable Gaussian sampler that improves inference-time scaling in latent reasoning models by replacing heuristic perturbations with explicit, optimized probabilistic sampling, leading to more reliable reasoning.

Contribution

The paper proposes GTS, a novel module that learns conditional distributions for latent exploration, replacing heuristic methods with an explicit, trainable sampling policy.

Findings

01

GTS outperforms heuristic perturbations across multiple benchmarks.

02

Explicit probabilistic sampling improves the reliability of inference-time scaling.

03

GTS demonstrates better control and optimization of latent exploration.

Abstract

Inference-time scaling (ITS) in latent reasoning models typically relies on heuristic perturbations, such as dropout or fixed Gaussian noise, to generate diverse candidate trajectories. However, we show that stronger perturbations do not necessarily yield better sampling quality: they often induce larger distribution shifts without producing more useful reasoning paths or better final decisions. A key limitation is that these perturbations inject stochasticity without defining an explicit conditional sampling distribution, making latent exploration difficult to control or optimize. To address this, we propose the Gaussian Thought Sampler (GTS), a lightweight module that reformulates latent exploration as sampling from a learned conditional distribution over continuous reasoning states. GTS predicts context-dependent perturbation distributions and is trained with GRPO-style policy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Reinforcement Learning in Robotics · Machine Learning in Healthcare