ModeX: Evaluator-Free Best-of-N Selection for Open-Ended Generation

Hyeong Kyu Choi; Sharon Li

arXiv:2601.02535·cs.CL·April 10, 2026

ModeX: Evaluator-Free Best-of-N Selection for Open-Ended Generation

Hyeong Kyu Choi, Sharon Li

PDF

1 Repo

TL;DR

ModeX introduces an evaluator-free, semantic consensus-based method for selecting high-quality outputs from multiple generations of large language models, improving robustness and efficiency in open-ended tasks.

Contribution

It generalizes majority voting to open-ended text by identifying the modal semantic output through spectral clustering, without external evaluators or models.

Findings

01

ModeX outperforms standard baselines in summarization, code generation, and reasoning.

02

ModeX-Lite offers an efficient variant with early pruning.

03

The approach is computationally efficient and improves robustness in open-ended generation.

Abstract

Selecting a single high-quality output from multiple stochastic generations remains a fundamental challenge for large language models (LLMs), particularly in open-ended tasks where no canonical answer exists. While Best-of-N and self-consistency methods show that aggregating multiple generations can improve performance, existing approaches typically rely on external evaluators, reward models, or exact string-match voting, limiting their applicability and efficiency. We propose Mode Extraction (ModeX), an evaluator-free Best-of-N selection framework that generalizes majority voting to open-ended text generation by identifying the modal output representing the dominant semantic consensus among generated texts. ModeX constructs a similarity graph over candidate generations and recursively applies spectral clustering to select a representative centroid, without requiring additional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

deeplearning-wisc/ModeX
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.