Min-$k$ Sampling: Decoupling Truncation from Temperature Scaling via Relative Logit Dynamics

Yuanhao Ding; Meimingwei Li; Esteban Garces Arias; Matthias A{\ss}enmacher; Christian Heumann; Chongsheng Zhang

arXiv:2604.11012·cs.AI·April 14, 2026

Min-$k$ Sampling: Decoupling Truncation from Temperature Scaling via Relative Logit Dynamics

Yuanhao Ding, Meimingwei Li, Esteban Garces Arias, Matthias A{\ss}enmacher, Christian Heumann, Chongsheng Zhang

PDF

1 Repo

TL;DR

Min-$k$ Sampling introduces a dynamic truncation method analyzing local logit distribution shapes to improve text generation quality and robustness across various tasks, independent of temperature sensitivity.

Contribution

It proposes Min-$k$ Sampling, a novel temperature-invariant truncation strategy that adapts to local confidence structures, outperforming existing methods.

Findings

01

Min-$k$ achieves strict temperature invariance.

02

It improves text quality across reasoning and creative tasks.

03

It maintains robustness even at extreme temperature settings.

Abstract

The quality of text generated by large language models depends critically on the decoding sampling strategy. While mainstream methods such as Top- $k$ , Top- $p$ , and Min- $p$ achieve a balance between diversity and accuracy through probability-space truncation, they share an inherent limitation: extreme sensitivity to the temperature parameter. Recent logit-space approaches like Top- $nσ$ achieve temperature invariance but rely on global statistics that are susceptible to long-tail noise, failing to capture fine-grained confidence structures among top candidates. We propose \textbf{Min- $k$ Sampling}, a novel dynamic truncation strategy that analyzes the local shape of the sorted logit distribution to identify "semantic cliffs": sharp transitions from high-confidence core tokens to uncertain long-tail tokens. By computing a position-weighted relative decay rate, Min- $k$ dynamically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

null
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.