TL;DR
This paper introduces ASI-Evolve, a framework where AI autonomously advances AI research by discovering new architectures, data curation methods, and algorithms, demonstrating significant improvements across multiple AI development stages.
Contribution
ASI-Evolve is the first unified framework enabling AI-driven discovery in data, architectures, and algorithms, significantly outperforming previous methods in each area.
Findings
Discovered 105 state-of-the-art linear attention architectures, surpassing DeltaNet.
Improved benchmark performance by an average of +3.96 points through evolved data pipelines.
Discovered algorithms outperform existing methods like GRPO by up to +12.5 points.
Abstract
Can AI accelerate the development of AI itself? While recent agentic systems have shown strong performance on well-scoped tasks with rapid feedback, it remains unclear whether they can tackle the costly, long-horizon, and weakly supervised research loops that drive real AI progress. We present ASI-Evolve, an agentic framework for AI-for-AI research that closes this loop through a learn-design-experiment-analyze cycle. ASI-Evolve augments standard evolutionary agents with two key components: a cognition base that injects accumulated human priors into each round of exploration, and a dedicated analyzer that distills complex experimental outcomes into reusable insights for future iterations. To our knowledge, ASI-Evolve is the first unified framework to demonstrate AI-driven discovery across three central components of AI development: data, architectures, and learning algorithms. In neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
