Efficient Post-Training Refinement of Latent Reasoning in Large Language Models

Xinyuan Wang; Dongjie Wang; Wangyang Ying; Haoyue Bai; Nanxu Gong; Sixun Dong; Kunpeng Liu; Yanjie Fu

arXiv:2506.08552·cs.CL·November 18, 2025

Efficient Post-Training Refinement of Latent Reasoning in Large Language Models

Xinyuan Wang, Dongjie Wang, Wangyang Ying, Haoyue Bai, Nanxu Gong, Sixun Dong, Kunpeng Liu, Yanjie Fu

PDF

Open Access

TL;DR

This paper introduces a lightweight post-training method to refine latent reasoning in large language models, improving reasoning accuracy without extra training by using contrastive feedback and residual embedding updates.

Contribution

It proposes a novel post-training framework with two strategies for refining latent reasoning trajectories, addressing limitations of existing latent reasoning methods.

Findings

01

Achieves a 5% accuracy improvement on MathQA

02

Effective across five reasoning benchmarks

03

Enhances reasoning without additional training

Abstract

Reasoning is a key component of language understanding in Large Language Models. While Chain-of-Thought prompting enhances performance via explicit intermediate steps, it suffers from sufficient token overhead and a fixed reasoning trajectory, preventing step-wise refinement. Recent advances in latent reasoning address these limitations by refining internal reasoning processes directly in the model's latent space, without producing explicit outputs. However, a key challenge remains: how to effectively update reasoning embeddings during post-training to guide the model toward more accurate solutions. To overcome this challenge, we propose a lightweight post-training framework that refines latent reasoning trajectories using two novel strategies: 1) Contrastive reasoning feedback, which compares reasoning embeddings against strong and weak baselines to infer effective update directions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Multimodal Machine Learning Applications