TL;DR
The paper introduces the Darwin G"odel Machine, a self-improving AI system inspired by evolution, which iteratively modifies and validates its own code, leading to significant improvements in coding benchmarks and demonstrating open-ended, safe self-improvement.
Contribution
It presents the Darwin G"odel Machine, an open-ended, evolution-inspired framework for self-improving AI that empirically demonstrates continuous performance enhancements.
Findings
Improved performance on SWE-bench from 20.0% to 50.0%.
Enhanced results on Polyglot from 14.2% to 30.7%.
Outperforms baselines without self-improvement or open-ended exploration.
Abstract
Today's AI systems have human-designed, fixed architectures and cannot autonomously and continuously improve themselves. The advance of AI could itself be automated. If done safely, that would accelerate AI development and allow us to reap its benefits much sooner. Meta-learning can automate the discovery of novel algorithms, but is limited by first-order improvements and the human design of a suitable search space. The G\"odel machine proposed a theoretical alternative: a self-improving AI that repeatedly modifies itself in a provably beneficial manner. Unfortunately, proving that most changes are net beneficial is impossible in practice. We introduce the Darwin G\"odel Machine (DGM), a self-improving system that iteratively modifies its own code (thereby also improving its ability to modify its own codebase) and empirically validates each change using coding benchmarks. Inspired by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
