ARC-AGI Without Pretraining

Isaac Liao; Albert Gu

arXiv:2512.06104·cs.LG·December 9, 2025

ARC-AGI Without Pretraining

Isaac Liao, Albert Gu

PDF

Open Access

TL;DR

This paper introduces CompressARC, a 76K parameter model that, without pretraining, solves 20% of ARC-AGI puzzles by minimizing description length during inference, demonstrating an alternative approach to achieving generalization in AI.

Contribution

CompressARC is the first deep learning model trained solely on individual puzzles without pretraining, showcasing the potential of MDL for generalization in AI.

Findings

01

Solves 20% of ARC-AGI puzzles without pretraining

02

Demonstrates MDL as an effective inference-time learning method

03

Operates with only 76K parameters, indicating efficiency

Abstract

Conventional wisdom in the age of LLMs dictates that solving IQ-test-like visual puzzles from the ARC-AGI-1 benchmark requires capabilities derived from massive pretraining. To counter this, we introduce CompressARC, a 76K parameter model without any pretraining that solves 20% of evaluation puzzles by minimizing the description length (MDL) of the target puzzle purely during inference time. The MDL endows CompressARC with extreme generalization abilities typically unheard of in deep learning. To our knowledge, CompressARC is the only deep learning method for ARC-AGI where training happens only on a single sample: the target inference puzzle itself, with the final solution information removed. Moreover, CompressARC does not train on the pre-provided ARC-AGI "training set". Under these extremely data-limited conditions, we do not ordinarily expect any puzzles to be solvable at all. Yet…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace Recognition and Perception · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications