TL;DR
Min-$k$ Sampling introduces a dynamic truncation method analyzing local logit distribution shapes to improve text generation quality and robustness across various tasks, independent of temperature sensitivity.
Contribution
It proposes Min-$k$ Sampling, a novel temperature-invariant truncation strategy that adapts to local confidence structures, outperforming existing methods.
Findings
Min-$k$ achieves strict temperature invariance.
It improves text quality across reasoning and creative tasks.
It maintains robustness even at extreme temperature settings.
Abstract
The quality of text generated by large language models depends critically on the decoding sampling strategy. While mainstream methods such as Top-, Top-, and Min- achieve a balance between diversity and accuracy through probability-space truncation, they share an inherent limitation: extreme sensitivity to the temperature parameter. Recent logit-space approaches like Top- achieve temperature invariance but rely on global statistics that are susceptible to long-tail noise, failing to capture fine-grained confidence structures among top candidates. We propose \textbf{Min- Sampling}, a novel dynamic truncation strategy that analyzes the local shape of the sorted logit distribution to identify "semantic cliffs": sharp transitions from high-confidence core tokens to uncertain long-tail tokens. By computing a position-weighted relative decay rate, Min- dynamically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
