SimpleStrat: Diversifying Language Model Generation with Stratification

Justin Wong; Yury Orlovskiy; Michael Luo; Sanjit A. Seshia; Joseph E.; Gonzalez

arXiv:2410.09038·cs.CL·October 15, 2024

SimpleStrat: Diversifying Language Model Generation with Stratification

Justin Wong, Yury Orlovskiy, Michael Luo, Sanjit A. Seshia, Joseph E., Gonzalez

PDF

Open Access 1 Video

TL;DR

SimpleStrat is a novel method that enhances diversity in language model outputs by stratifying the response space, outperforming temperature-based methods in quality and diversity metrics.

Contribution

We introduce SimpleStrat, a stratification-based approach for diversifying language model responses, and propose CoverageQA to evaluate diversity effectively.

Findings

01

SimpleStrat achieves higher recall than GPT-4o.

02

SimpleStrat reduces KL Divergence by 0.36 on average.

03

Temperature increase lowers response quality, contrary to prior belief.

Abstract

Generating diverse responses from large language models (LLMs) is crucial for applications such as planning/search and synthetic data generation, where diversity provides distinct answers across generations. Prior approaches rely on increasing temperature to increase diversity. However, contrary to popular belief, we show not only does this approach produce lower quality individual generations as temperature increases, but it depends on model's next-token probabilities being similar to the true distribution of answers. We propose SimpleStrat, an alternative approach that uses the language model itself to partition the space into strata. At inference, a random stratum is selected and a sample drawn from within the strata. To measure diversity, we introduce CoverageQA, a dataset of underspecified questions with multiple equally plausible answers, and assess diversity by measuring KL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

SimpleStrat: Diversifying Language Model Generation with Stratification· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsLLaMA