Optimizing Diversity and Quality through Base-Aligned Model Collaboration

Yichen Wang; Chenghao Yang; Tenghao Huang; Muhao Chen; Jonathan May; Mina Lee

arXiv:2511.05650·cs.CL·November 11, 2025

Optimizing Diversity and Quality through Base-Aligned Model Collaboration

Yichen Wang, Chenghao Yang, Tenghao Huang, Muhao Chen, Jonathan May, Mina Lee

PDF

Open Access

TL;DR

This paper introduces BACo, a novel inference-time model collaboration framework that dynamically combines aligned language models to enhance both diversity and quality of generated outputs, outperforming existing methods.

Contribution

We propose BACo, a new token-level model collaboration method that improves diversity and quality simultaneously at inference time, with strong controllability and state-of-the-art results.

Findings

01

BACo achieves a 21.3% joint improvement in diversity and quality.

02

It surpasses state-of-the-art inference-time baselines across multiple tasks and metrics.

03

Human evaluations confirm the effectiveness of BACo in real-world scenarios.

Abstract

Alignment has greatly improved large language models (LLMs)' output quality at the cost of diversity, yielding highly similar outputs across generations. We propose Base-Aligned Model Collaboration (BACo), an inference-time token-level model collaboration framework that dynamically combines a base LLM with its aligned counterpart to optimize diversity and quality. Inspired by prior work (Fei et al., 2025), BACo employs routing strategies that determine, at each token, from which model to decode based on next-token prediction uncertainty and predicted contents' semantic role. Prior diversity-promoting methods, such as retraining, prompt engineering, and multi-sampling methods, improve diversity but often degrade quality or require costly decoding or post-training. In contrast, BACo achieves both high diversity and quality post hoc within a single pass, while offering strong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning