Improving Model Alignment Through Collective Intelligence of Open-Source   LLMS

Junlin Wang; Roy Xie; Shang Zhu; Jue Wang; Ben Athiwaratkun; Bhuwan; Dhingra; Shuaiwen Leon Song; Ce Zhang; James Zou

arXiv:2505.03059·cs.CL·May 7, 2025

Improving Model Alignment Through Collective Intelligence of Open-Source LLMS

Junlin Wang, Roy Xie, Shang Zhu, Jue Wang, Ben Athiwaratkun, Bhuwan, Dhingra, Shuaiwen Leon Song, Ce Zhang, James Zou

PDF

Open Access 4 Models 1 Datasets

TL;DR

This paper introduces MoAA, a novel method leveraging collective intelligence of multiple language models to generate high-quality data for aligning open-source LLMs, improving performance and enabling self-improvement without external supervision.

Contribution

We propose MoAA, a scalable approach that combines various models' strengths to enhance data quality for model alignment, surpassing single-model methods and enabling self-improvement.

Findings

01

Significant performance gains on benchmark tasks.

02

Models fine-tuned on MoAA data outperform initial capabilities.

03

Demonstrates self-improvement without external data.

Abstract

Building helpful and harmless large language models (LLMs) requires effective model alignment approach based on human instructions and feedback, which necessitates high-quality human-labeled data. Constructing such datasets is often expensive and hard to scale, and may face potential limitations on diversity and generalization. To address these challenges, we introduce Mixture of Agents Alignment (MoAA), that leverages the collective strengths of various language models to provide high-quality data for model alignment. By employing MoAA, we enhance both supervised fine-tuning and preference optimization, leading to improved performance compared to using a single model alone to generate alignment data (e.g. using GPT-4o alone). Evaluation results show that our approach can improve win rate of LLaMA-3.1-8B-Instruct from 19.5 to 48.3 on Arena-Hard and from 22.33 to 57.23 on AlpacaEval2,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

togethercomputer/MoAA-SFT
dataset· 26 dl
26 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Business Process Modeling and Analysis · Service-Oriented Architecture and Web Services