Semantic Step Prediction: Multi-Step Latent Forecasting in LLM Reasoning Trajectories via Step Sampling

Yidi Yuan

arXiv:2604.18464·cs.LG·April 21, 2026

Semantic Step Prediction: Multi-Step Latent Forecasting in LLM Reasoning Trajectories via Step Sampling

Yidi Yuan

PDF

TL;DR

This paper introduces a semantic step sampling method for LLM reasoning trajectories that significantly improves multi-step latent prediction accuracy by focusing on semantic boundaries, revealing a tradeoff between generation quality and geometric regularity.

Contribution

It demonstrates that sampling at semantic reasoning step boundaries enhances geometric regularization and multi-step prediction accuracy in LLMs, surpassing random token sampling methods.

Findings

01

168x more accurate multi-step latent prediction with semantic boundary sampling

02

Trajectory shapes are smooth curves, not straight lines, improving predictability

03

Removing language modeling loss increases trajectory predictability, indicating a tradeoff

Abstract

Semantic Tube Prediction (STP) leverages representation geometric to regularize LLM hidden-state trajectories toward locally linear geodesics during fine-tuning, thereby greatly improving data efficiency. The original STP recipe samples random token sub-spans, which is compatible with the base large language model (LLM) training architecture. Inspired by STP, we are interested to investigate whether the sampling position can further enhance the semantic structure of multi-step reasoning, and hence affect its geometric impact. We applied STP at consecutive semantic reasoning step boundaries and achieved 168x more accurate multi-step latent prediction than frozen baselines on ProcessBench (3,400 samples), compared to only 4x for the random-token STP. Probing the latent manifold with a learned non-linear predictor reveals that STP-shaped trajectories are smooth curves, not straight lines:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.