Diverse, not Short: A Length-Controlled Data Selection Strategy for Improving Response Diversity of Language Models

Vijeta Deshpande; Debasmita Ghose; John D. Patterson; Roger Beaty; Anna Rumshisky

arXiv:2505.16245·cs.CL·February 2, 2026

Diverse, not Short: A Length-Controlled Data Selection Strategy for Improving Response Diversity of Language Models

Vijeta Deshpande, Debasmita Ghose, John D. Patterson, Roger Beaty, Anna Rumshisky

PDF

Open Access 1 Datasets 1 Video

TL;DR

This paper introduces Diverse-NS, a length-controlled data selection strategy that enhances response diversity in language models without sacrificing output length, leading to more expressive and creative responses.

Contribution

The paper presents a novel length-controlled data filtering method that improves response diversity in language models, addressing length bias in existing diversity metrics and reward models.

Findings

01

Diverse-NS significantly increases lexical and semantic diversity in responses.

02

The method achieves these improvements with only 3,000 preference pairs.

03

Smaller models like Olmo-2-7B can effectively teach larger models diversity.

Abstract

Diverse language model responses are crucial for creative generation, open-ended tasks, and self-improvement training. We show that common diversity metrics, and even reward models used for preference optimization, systematically bias models toward shorter outputs, limiting expressiveness. To address this, we introduce Diverse, not Short (Diverse-NS), a length-controlled data selection strategy that improves response diversity while maintaining length parity. By generating and filtering preference data that balances diversity, quality, and length, Diverse-NS enables effective training using only 3,000 preference pairs. Applied to LLaMA-3.1-8B and the Olmo-2 family, Diverse-NS substantially enhances lexical and semantic diversity. We show consistent improvement in diversity with minor reduction or gains in response quality on four creative generation tasks: Divergent Associations,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

text-machine-lab/diverse-not-short
dataset· 9 dl
9 dl

Videos

Diverse, not Short: A Length-Controlled Data Selection Strategy for Improving Response Diversity of Language Models· underline

Taxonomy

TopicsPersona Design and Applications · Topic Modeling · Machine Learning in Healthcare

MethodsSelf-Learning