Language Modeling by Language Models

Junyan Cheng; Peter Clark; Kyle Richardson

arXiv:2506.20249·cs.AI·June 26, 2025

Language Modeling by Language Models

Junyan Cheng, Peter Clark, Kyle Richardson

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Genesys, a multi-agent LLM system that automates the discovery and evaluation of novel language model architectures, significantly improving design success rates and achieving competitive performance on benchmarks.

Contribution

The paper presents a novel multi-agent LLM framework with a genetic programming backbone for efficient autonomous LM architecture discovery, outperforming traditional prompt workflows.

Findings

01

Discovered 1,162 new LM designs, with 1,062 fully verified.

02

Best designs outperform GPT2, Mamba2 on 6 out of 9 benchmarks.

03

86% improvement in successful design generation over prompt-based methods.

Abstract

Can we leverage LLMs to model the process of discovering novel language model (LM) architectures? Inspired by real research, we propose a multi-agent LLM approach that simulates the conventional stages of research, from ideation and literature search (proposal stage) to design implementation (code generation), generative pre-training, and downstream evaluation (verification). Using ideas from scaling laws, our system, Genesys, employs a Ladder of Scales approach; new designs are proposed, adversarially reviewed, implemented, and selectively verified at increasingly larger model scales (14M $\sim$ 350M parameters) with a narrowing budget (the number of models we can train at each scale). To help make discovery efficient and factorizable, Genesys uses a novel genetic programming backbone, which we show has empirical advantages over commonly used direct prompt generation workflows (e.g.,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

allenai/genesys
jaxOfficial

Videos

Language Modeling by Language Models· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling