TL;DR
ModeX introduces an evaluator-free, semantic consensus-based method for selecting high-quality outputs from multiple generations of large language models, improving robustness and efficiency in open-ended tasks.
Contribution
It generalizes majority voting to open-ended text by identifying the modal semantic output through spectral clustering, without external evaluators or models.
Findings
ModeX outperforms standard baselines in summarization, code generation, and reasoning.
ModeX-Lite offers an efficient variant with early pruning.
The approach is computationally efficient and improves robustness in open-ended generation.
Abstract
Selecting a single high-quality output from multiple stochastic generations remains a fundamental challenge for large language models (LLMs), particularly in open-ended tasks where no canonical answer exists. While Best-of-N and self-consistency methods show that aggregating multiple generations can improve performance, existing approaches typically rely on external evaluators, reward models, or exact string-match voting, limiting their applicability and efficiency. We propose Mode Extraction (ModeX), an evaluator-free Best-of-N selection framework that generalizes majority voting to open-ended text generation by identifying the modal output representing the dominant semantic consensus among generated texts. ModeX constructs a similarity graph over candidate generations and recursively applies spectral clustering to select a representative centroid, without requiring additional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
