Language Modeling by Language Models
Junyan Cheng, Peter Clark, Kyle Richardson

TL;DR
This paper introduces Genesys, a multi-agent LLM system that automates the discovery and evaluation of novel language model architectures, significantly improving design success rates and achieving competitive performance on benchmarks.
Contribution
The paper presents a novel multi-agent LLM framework with a genetic programming backbone for efficient autonomous LM architecture discovery, outperforming traditional prompt workflows.
Findings
Discovered 1,162 new LM designs, with 1,062 fully verified.
Best designs outperform GPT2, Mamba2 on 6 out of 9 benchmarks.
86% improvement in successful design generation over prompt-based methods.
Abstract
Can we leverage LLMs to model the process of discovering novel language model (LM) architectures? Inspired by real research, we propose a multi-agent LLM approach that simulates the conventional stages of research, from ideation and literature search (proposal stage) to design implementation (code generation), generative pre-training, and downstream evaluation (verification). Using ideas from scaling laws, our system, Genesys, employs a Ladder of Scales approach; new designs are proposed, adversarially reviewed, implemented, and selectively verified at increasingly larger model scales (14M350M parameters) with a narrowing budget (the number of models we can train at each scale). To help make discovery efficient and factorizable, Genesys uses a novel genetic programming backbone, which we show has empirical advantages over commonly used direct prompt generation workflows (e.g.,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
