SeLaR: Selective Latent Reasoning in Large Language Models

Renyu Fu; Guibo Luo

arXiv:2604.08299·cs.CL·April 21, 2026

SeLaR: Selective Latent Reasoning in Large Language Models

Renyu Fu, Guibo Luo

PDF

TL;DR

SeLaR enhances reasoning in large language models by selectively activating soft embeddings during low-confidence steps and encouraging diverse reasoning paths, leading to improved performance without additional training.

Contribution

SeLaR introduces an entropy-gated mechanism and contrastive regularization to improve latent reasoning in large language models without training.

Findings

01

SeLaR outperforms standard CoT on five reasoning benchmarks.

02

SeLaR maintains reasoning stability by activating soft embeddings only at low-confidence steps.

03

SeLaR encourages exploration of multiple reasoning trajectories, improving accuracy.

Abstract

Chain-of-Thought (CoT) has become a cornerstone of reasoning in large language models, yet its effectiveness is constrained by the limited expressiveness of discrete token sampling. Recent latent reasoning approaches attempt to alleviate this limitation by replacing discrete tokens with soft embeddings (probability-weighted mixtures of token embeddings) or hidden states, but they commonly suffer from two issues: (1) global activation injects perturbations into high-confidence steps, impairing reasoning stability; and (2) soft embeddings quickly collapse toward the highest-probability token, limiting exploration of alternative trajectories. To address these challenges, we propose SeLaR (Selective Latent Reasoning), a lightweight and training-free framework. SeLaR introduces an entropy-gated mechanism that activates soft embeddings only at low-confidence steps, while preserving discrete…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.