SeLaR: Selective Latent Reasoning in Large Language Models
Renyu Fu, Guibo Luo

TL;DR
SeLaR enhances reasoning in large language models by selectively activating soft embeddings during low-confidence steps and encouraging diverse reasoning paths, leading to improved performance without additional training.
Contribution
SeLaR introduces an entropy-gated mechanism and contrastive regularization to improve latent reasoning in large language models without training.
Findings
SeLaR outperforms standard CoT on five reasoning benchmarks.
SeLaR maintains reasoning stability by activating soft embeddings only at low-confidence steps.
SeLaR encourages exploration of multiple reasoning trajectories, improving accuracy.
Abstract
Chain-of-Thought (CoT) has become a cornerstone of reasoning in large language models, yet its effectiveness is constrained by the limited expressiveness of discrete token sampling. Recent latent reasoning approaches attempt to alleviate this limitation by replacing discrete tokens with soft embeddings (probability-weighted mixtures of token embeddings) or hidden states, but they commonly suffer from two issues: (1) global activation injects perturbations into high-confidence steps, impairing reasoning stability; and (2) soft embeddings quickly collapse toward the highest-probability token, limiting exploration of alternative trajectories. To address these challenges, we propose SeLaR (Selective Latent Reasoning), a lightweight and training-free framework. SeLaR introduces an entropy-gated mechanism that activates soft embeddings only at low-confidence steps, while preserving discrete…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
