Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design
Alberto Pepe, Chien-Yu Lin, Despoina Magka, Bilge Acun, Yannan Nellie Wu, Anton Protopopov, Carole-Jean Wu, Yoram Bachrach

TL;DR
This paper presents AIRA-Compose and AIRA-Design, AI agent frameworks that autonomously discover and optimize neural architectures and mechanisms, outperforming hand-designed models and enabling recursive self-improvement.
Contribution
Introduction of dual AI agent frameworks for autonomous neural architecture and mechanism discovery, demonstrating superior performance and efficiency over existing models.
Findings
AIRA-Compose discovers architectures outperforming Llama 3.2 and Composer baselines.
AIRA-Design agents create architectures close to state-of-the-art on benchmarks.
Models found by AIRA-Compose scale faster and achieve better accuracy.
Abstract
Toward recursive self-improvement, we investigate LLM agents autonomously designing foundation models beyond standard Transformers. We introduce a dual-framework approach: AIRA-Compose for high-level architecture search, and AIRA-Design for low-level mechanistic implementation. AIRA-Compose uses 11 agents to explore fundamental computational primitives under a 24-hour budget. Agents evaluate million-parameter candidates, extrapolating top designs to 350M, 1B, and 3B scales. This yields 14 architectures across two families: AIRAformers (Transformer-based) and AIRAhybrids (Transformer-Mamba). Pre-trained at 1B scale, these consistently outperform Llama 3.2 and Composer-found baselines. On downstream tasks, AIRAformer-D and AIRAhybrid-D improve accuracy by 2.4% and 3.8% over Llama 3.2. Furthermore, AIRA-Compose finds models with highly efficient scaling frontiers: AIRAformer-C scales 54%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
