The Single-Multi Evolution Loop for Self-Improving Model Collaboration Systems
Shangbin Feng, Kishan Panaganti, Yulia Tsvetkov, Wenhao Yu

TL;DR
This paper introduces a novel single-multi evolution loop that enables self-improving model collaboration systems, distilling collaborative patterns into a single model to enhance efficiency and collective performance across multiple tasks.
Contribution
It proposes the single-multi evolution loop framework, allowing models to self-improve through iterative collaboration and distillation, outperforming existing evolutionary AI methods.
Findings
Models improve by 8.0% on average after distillation.
Collaboration benefits increase by 14.9% after evolution.
The method reduces computational costs by using a single distilled model.
Abstract
Model collaboration -- systems where multiple language models (LMs) collaborate -- combines the strengths of diverse models with cost in loading multiple LMs. We improve efficiency while preserving the strengths of collaboration by distilling collaborative patterns into a single model, where the model is trained on the outputs of the model collaboration system. At inference time, only the distilled model is employed: it imitates the collaboration while only incurring the cost of a single model. Furthermore, we propose the single-multi evolution loop: multiple LMs collaborate, each distills from the collaborative outputs, and these post-distillation improved LMs collaborate again, forming a collective evolution ecosystem where models evolve and self-improve by interacting with an environment of other models. Extensive experiments with 7 collaboration strategies and 15 tasks (QA,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Model-Driven Software Engineering Techniques · Natural Language Processing Techniques
