Product of Experts with LLMs: Boosting Performance on ARC Is a Matter of Perspective

Daniel Franzen; Jan Disselhoff; David Hartmann

arXiv:2505.07859·cs.CL·June 12, 2025

Product of Experts with LLMs: Boosting Performance on ARC Is a Matter of Perspective

Daniel Franzen, Jan Disselhoff, David Hartmann

PDF

Open Access 2 Models

TL;DR

This paper introduces a novel product of experts approach using large language models to improve abstract reasoning on the ARC-AGI benchmark, combining data augmentation, diverse candidate generation, and LLM-based scoring.

Contribution

It presents a transparent, reproducible method that leverages task-specific data augmentation and LLM scoring to achieve state-of-the-art performance on ARC-AGI.

Findings

01

Achieved 71.6% solved tasks on ARC-AGI

02

Low inference cost of around 2 cents per task

03

Outperforms many existing approaches in transparency and efficiency

Abstract

The Abstraction and Reasoning Corpus (ARC-AGI) poses a significant challenge for large language models (LLMs), exposing limitations in their abstract reasoning abilities. In this work, we leverage task-specific data augmentations throughout the training, generation, and scoring phases, and employ a depth-first search algorithm to generate diverse, high-probability candidate solutions. Furthermore, we utilize the LLM not only as a generator but also as a scorer, using its output probabilities to select the most promising solutions. Our method achieves a score of 71.6% (286.5/400 solved tasks) on the public ARC-AGI evaluation set, demonstrating state-of-the-art performance among publicly available approaches. While concurrent closed-source work has reported higher scores, our method distinguishes itself through its transparency, reproducibility, and remarkably low inference cost,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications