ARC Prize 2025: Technical Report
Fran\c{c}ois Chollet, Mike Knoop, Gregory Kamradt, Bryan Landers

TL;DR
The paper discusses the ARC-AGI benchmark series, highlighting recent advancements, the emergence of refinement loops in AI reasoning, and the industry’s adoption of ARC-AGI as a standard for measuring fluid intelligence and abstract reasoning.
Contribution
It surveys top methods, analyzes the role of refinement loops in AGI progress, and introduces future interactive reasoning challenges in ARC-AGI-3.
Findings
ARC-AGI-2 dataset presents increased task complexity.
Refinement loops are emerging as a key component in AI reasoning.
Industry adoption of ARC-AGI as a standard benchmark.
Abstract
The ARC-AGI benchmark series serves as a critical measure of few-shot generalization on novel tasks, a core aspect of intelligence. The ARC Prize 2025 global competition targeted the newly released ARC-AGI-2 dataset, which features greater task complexity compared to its predecessor. The Kaggle competition attracted 1,455 teams and 15,154 entries, with the top score reaching 24% on the ARC-AGI-2 private evaluation set. Paper submissions nearly doubled year-over-year to 90 entries, reflecting the growing research interest in fluid intelligence and abstract reasoning. The defining theme of 2025 is the emergence of the refinement loop -- a per-task iterative program optimization loop guided by a feedback signal. Refinement loops come in a variety of forms, in particular evolutionary program synthesis approaches and application-layer refinements to commercial AI systems. Such refinement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics
