DARWIN: Dynamic Agentically Rewriting Self-Improving Network
Henry Jiang

TL;DR
DARWIN is an evolutionary GPT training framework where multiple agents iteratively modify training code, leading to performance improvements demonstrated by increased efficiency and reduced perplexity over several iterations.
Contribution
The paper introduces a novel self-improving GPT training method using genetic algorithms and multi-agent code modification, with integrated HITL and memory tracking.
Findings
Achieved 1.26% improvement in FLOPS utilization
Achieved 2.07% reduction in perplexity
Demonstrated scalability over multiple iterations
Abstract
DARWIN is an evolutionary GPT model, utilizing a genetic-algorithm like optimization structure with several independent GPT agents being trained individually using unique training code. Each iteration, the GPT models are prompted to modify the training code of one another in an attempt to improve their performance in a mutation-like manner, and the best GPT agents are then benchmarked and selected for the next iteration by genetic algorithm. For demonstration purposes and due to budget and time constraints, OpenAI API is used to prompt training code improvements and the nanoGPT framework is used as the training code. DARWIN also utilizes persistent JSON-based memory files to track previous reasoning and changes to code to correlate with improvement to model performance. and a bidirectional interface for HITL intervention allowing the model to request upgrades such as additional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Machine Learning and Algorithms · Scientific Computing and Data Management
