Advancing Sequential Numerical Prediction in Autoregressive Models

Xiang Fei; Jinghui Lu; Qi Sun; Hao Feng; Yanjie Wang; Wei Shi; An-Lan Wang; Jingqun Tang; Can Huang

arXiv:2505.13077·cs.CL·May 29, 2025

Advancing Sequential Numerical Prediction in Autoregressive Models

Xiang Fei, Jinghui Lu, Qi Sun, Hao Feng, Yanjie Wang, Wei Shi, An-Lan Wang, Jingqun Tang, Can Huang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Numerical Token Integrity Loss (NTIL), a novel method that enhances autoregressive models' ability to generate coherent numerical sequences by preserving ordinal relationships and sequence integrity.

Contribution

The paper proposes NTIL, a dual-level loss function that improves numerical sequence prediction in autoregressive models, addressing limitations of standard token-based approaches.

Findings

01

NTIL significantly improves numerical prediction accuracy.

02

NTIL effectively preserves ordinal relationships in sequences.

03

NTIL integrates well with large language models.

Abstract

Autoregressive models have become the de facto choice for sequence generation tasks, but standard approaches treat digits as independent tokens and apply cross-entropy loss, overlooking the coherent structure of numerical sequences. This paper introduces Numerical Token Integrity Loss (NTIL) to address this gap. NTIL operates at two levels: (1) token-level, where it extends the Earth Mover's Distance (EMD) to preserve ordinal relationships between numerical values, and (2) sequence-level, where it penalizes the overall discrepancy between the predicted and actual sequences. This dual approach improves numerical prediction and integrates effectively with LLMs/MLLMs. Extensive experiments show significant performance improvements with NTIL.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xfey/ntil
pytorchOfficial

Videos

Advancing Sequential Numerical Prediction in Autoregressive Models· underline

Taxonomy

TopicsNeural Networks and Applications