TL;DR
The paper introduces AC/DC, a coevolutionary framework for discovering diverse and novel LLM capabilities by evolving models and tasks simultaneously, surpassing larger models without explicit benchmark optimization.
Contribution
It extends coevolution to LLM discovery, enabling continual innovation in model capabilities and task diversity within a single training run.
Findings
AC/DC discovers LLMs with broader expertise coverage.
Models surpass larger LLMs while using less GPU memory.
Coverage of capabilities improves over time without explicit benchmark tuning.
Abstract
Frontier model developers aim to train models continually to possess emergent, diverse capabilities. To extend capabilities, the current pre-training and post-training paradigm requires manually starting training runs with static datasets or reward functions every time. Addressing this limitation, our work pursues the insight that open-endedness (via the coevolution of models and tasks) can discover models with increasingly novel skills in a single run. We introduce a new model development framework that extends coevolution to large language model (LLM) discovery, open-ended \textit{Assessment Coevolving with Diverse Capabilities} (AC/DC). AC/DC evolves both LLMs via model merging and natural language tasks via synthetic data generation. AC/DC discovers growing archives of LLMs that surpass the capabilities of larger LLMs while taking up less GPU memory. In particular, our LLM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
