ARC-AGI Without Pretraining
Isaac Liao, Albert Gu

TL;DR
This paper introduces CompressARC, a 76K parameter model that, without pretraining, solves 20% of ARC-AGI puzzles by minimizing description length during inference, demonstrating an alternative approach to achieving generalization in AI.
Contribution
CompressARC is the first deep learning model trained solely on individual puzzles without pretraining, showcasing the potential of MDL for generalization in AI.
Findings
Solves 20% of ARC-AGI puzzles without pretraining
Demonstrates MDL as an effective inference-time learning method
Operates with only 76K parameters, indicating efficiency
Abstract
Conventional wisdom in the age of LLMs dictates that solving IQ-test-like visual puzzles from the ARC-AGI-1 benchmark requires capabilities derived from massive pretraining. To counter this, we introduce CompressARC, a 76K parameter model without any pretraining that solves 20% of evaluation puzzles by minimizing the description length (MDL) of the target puzzle purely during inference time. The MDL endows CompressARC with extreme generalization abilities typically unheard of in deep learning. To our knowledge, CompressARC is the only deep learning method for ARC-AGI where training happens only on a single sample: the target inference puzzle itself, with the final solution information removed. Moreover, CompressARC does not train on the pre-provided ARC-AGI "training set". Under these extremely data-limited conditions, we do not ordinarily expect any puzzles to be solvable at all. Yet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace Recognition and Perception · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
