Semantic Step Prediction: Multi-Step Latent Forecasting in LLM Reasoning Trajectories via Step Sampling
Yidi Yuan

TL;DR
This paper introduces a semantic step sampling method for LLM reasoning trajectories that significantly improves multi-step latent prediction accuracy by focusing on semantic boundaries, revealing a tradeoff between generation quality and geometric regularity.
Contribution
It demonstrates that sampling at semantic reasoning step boundaries enhances geometric regularization and multi-step prediction accuracy in LLMs, surpassing random token sampling methods.
Findings
168x more accurate multi-step latent prediction with semantic boundary sampling
Trajectory shapes are smooth curves, not straight lines, improving predictability
Removing language modeling loss increases trajectory predictability, indicating a tradeoff
Abstract
Semantic Tube Prediction (STP) leverages representation geometric to regularize LLM hidden-state trajectories toward locally linear geodesics during fine-tuning, thereby greatly improving data efficiency. The original STP recipe samples random token sub-spans, which is compatible with the base large language model (LLM) training architecture. Inspired by STP, we are interested to investigate whether the sampling position can further enhance the semantic structure of multi-step reasoning, and hence affect its geometric impact. We applied STP at consecutive semantic reasoning step boundaries and achieved 168x more accurate multi-step latent prediction than frozen baselines on ProcessBench (3,400 samples), compared to only 4x for the random-token STP. Probing the latent manifold with a learned non-linear predictor reveals that STP-shaped trajectories are smooth curves, not straight lines:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
