Min-p, Max Exaggeration: A Critical Analysis of Min-p Sampling in Language Models
Rylan Schaeffer, Joshua Kazdan, Yegor Denisov-Blanch

TL;DR
This paper critically re-evaluates the claims of min-p sampling in language models, finding that it does not outperform existing methods in quality or diversity, contrary to prior claims, and questions the validity of community adoption claims.
Contribution
It provides a comprehensive reanalysis of min-p sampling, correcting methodological errors and challenging its purported advantages over baseline sampling methods.
Findings
Min-p does not outperform baselines in quality or diversity.
Original claims were based on flawed statistical and qualitative analyses.
Community adoption claims are unsubstantiated and misleading.
Abstract
Sampling from language models impacts the quality and diversity of outputs, affecting both research and real-world applications. Recently, Nguyen et al. 2024's "Turning Up the Heat: Min-p Sampling for Creative and Coherent LLM Outputs" introduced a new sampler called min-p, claiming it achieves superior quality and diversity over established samplers such as basic, top-k, and top-p sampling. The significance of these claims was underscored by the paper's recognition as the 18th highest-scoring submission to ICLR 2025 and selection for an Oral presentation. This paper conducts a comprehensive re-examination of the evidence supporting min-p and reaches different conclusions from the original paper's four lines of evidence. First, the original paper's human evaluations omitted data, conducted statistical tests incorrectly, and described qualitative feedback inaccurately; our reanalysis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Artificial Intelligence in Healthcare and Education · Topic Modeling
