Multi-Perspective Transformers in ARC-AGI-2 Challenge

Caleb Talley; Vedant Tibrewal; Seun Adekunle; Weiwen Dong; Xinyu Wu; Fariha Sheikh

arXiv:2605.01154·cs.LG·May 5, 2026

Multi-Perspective Transformers in ARC-AGI-2 Challenge

Caleb Talley, Vedant Tibrewal, Seun Adekunle, Weiwen Dong, Xinyu Wu, Fariha Sheikh

PDF

TL;DR

This paper presents a method using TinyLM with test-time fine-tuning techniques to solve ARC-AGI-2 visual puzzles, achieving high training accuracy but moderate evaluation performance.

Contribution

It introduces a multi-perspective transformer approach with test-time training and products of experts for solving visual puzzles in ARC-AGI-2.

Findings

01

96.1% training accuracy

02

21.7% evaluation accuracy

03

demonstrates potential of test-time fine-tuning techniques

Abstract

ARC-AGI-2 is a benchmark of human-intuitive visual puzzles that measures a machine's ability to generalize from limited examples, interpret symbolic meaning, and flexibly apply rules in varying contexts. In this paper, we discuss our approach to solving the ARC-AGI-2 puzzles with TinyLM, with additional fine-tuning at test time, including Test-Time-Training (TTT) and Products of Experts (POE). Our model achieves 96.1% accuracy on the training set and 21.7% accuracy on the evaluation set.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.