Text-to-Distribution Prediction with Quantile Tokens and Neighbor Context

Yilun Zhu; Yuan Zhuang; Nikhita Vedula; Dushyanta Dhyani; Shaoyuan Xu; Moyan Li; Mohsen Bayati; Bryan Wang; Shervin Malmasi

arXiv:2604.20216·cs.CL·April 23, 2026

Text-to-Distribution Prediction with Quantile Tokens and Neighbor Context

Yilun Zhu, Yuan Zhuang, Nikhita Vedula, Dushyanta Dhyani, Shaoyuan Xu, Moyan Li, Mohsen Bayati, Bryan Wang, Shervin Malmasi

PDF

TL;DR

This paper introduces Quantile Token Regression, a novel method for distributional regression with LLMs, using dedicated quantile tokens and neighbor retrieval to improve distribution predictions.

Contribution

It is the first to insert dedicated quantile tokens into input sequences and incorporate neighbor information, enhancing local grounding and prediction accuracy.

Findings

01

Quantile tokens with neighbors outperform baselines by ~4 points in MAPE.

02

The approach produces 2x narrower and more accurate prediction intervals.

03

Large gains are observed on smaller, challenging datasets.

Abstract

Many applications of LLM-based text regression require predicting a full conditional distribution rather than a single point value. We study distributional regression under empirical-quantile supervision, where each input is paired with multiple observed quantile outcomes, and the target distribution is represented by a dense grid of quantiles. We address two key limitations of current approaches: the lack of local grounding for distribution estimates, and the reliance on shared representations that create an indirect bottleneck between inputs and quantile outputs. In this paper, we introduce Quantile Token Regression, which, to our knowledge, is the first work to insert dedicated quantile tokens into the input sequence, enabling direct input-output pathways for each quantile through self-attention. We further augment these quantile tokens with retrieval, incorporating semantically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.