Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling

Zhen Zhang; Changyi Yang; Zijie Xia; Zhen Yang; Chengzhi Liu; Zhaotiao Weng; Yepeng Liu; Haobo Chen; Jin Pan; Chenyang Zhao; Yuheng Bu; Alkesh Patel; Zhe Gan; Xin Eric Wang

arXiv:2604.27039·cs.CL·May 1, 2026

Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling

Zhen Zhang, Changyi Yang, Zijie Xia, Zhen Yang, Chengzhi Liu, Zhaotiao Weng, Yepeng Liu, Haobo Chen, Jin Pan, Chenyang Zhao, Yuheng Bu, Alkesh Patel, Zhe Gan, Xin Eric Wang

PDF

1 Repo

TL;DR

The paper introduces LenVM, a token-level length modeling framework for autoregressive models, improving length prediction accuracy, inference efficiency, and interpretability across various tasks.

Contribution

LenVM formulates length modeling as a value estimation problem, providing a scalable, annotation-free, and dense supervision signal for token-level length prediction.

Findings

01

LenVM significantly improves length matching scores on LIFEBench.

02

It maintains high accuracy in length prediction under constrained token budgets.

03

LenVM offers interpretable insights into generation dynamics.

Abstract

Token serves as the fundamental unit of computation in modern autoregressive models, and generation length directly influences both inference cost and reasoning performance. Despite its importance, existing approaches lack fine-grained length modeling, operating primarily at the coarse-grained sequence level. We introduce the Length Value Model (LenVM), a token-level framework that models the remaining generation length. By formulating length modeling as a value estimation problem and assigning a constant negative reward to each generated token, LenVM predicts a bounded, discounted return that serves as a monotone proxy for the remaining generation horizon. This formulation yields supervision that is annotation-free, dense, unbiased, and scalable. Experiments on LLMs and VLMs demonstrate LenVM provides a highly effective signal at inference time. On the LIFEBench exact length matching…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eric-ai-lab/Length-Value-Model
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.