Procedural Refinement by LLM-driven Algorithmic Debugging for ARC-AGI-2

Yu-Ning Qiu; Lin-Feng Zou; Jiong-Da Wang; Xue-Rong Yuan; Wang-Zhou Dai

arXiv:2603.20334·cs.SE·May 15, 2026

Procedural Refinement by LLM-driven Algorithmic Debugging for ARC-AGI-2

Yu-Ning Qiu, Lin-Feng Zou, Jiong-Da Wang, Xue-Rong Yuan, Wang-Zhou Dai

PDF

TL;DR

This paper introduces ABPR, a neuro-symbolic refinement method combining LLMs with Prolog for semantic rule re-checking, significantly improving abstract reasoning performance on benchmarks like ARC-AGI-2.

Contribution

It presents a novel abduction-based refinement approach that enhances LLM-based rule induction through formal program debugging techniques.

Findings

01

ABPR achieves 56.67% Pass@2 on ARC-AGI-2.

02

With GPT-5.5 xHigh, ABPR reaches 98.33% Pass@2.

03

The framework extends to relational and analogical abstraction tasks.

Abstract

In high-complexity abstract reasoning, a system must infer a latent rule from a few examples or structured observations and apply it to unseen instances. LLMs can express such rules as programs, but ordinary conversation-based refinement is largely outcome-level: it observes that an answer or output is wrong without formally re-checking which abstraction, relation, or transformation justified that outcome. We propose \emph{Abduction-Based Procedural Refinement} (ABPR), a neuro-symbolic refinement approach that couples an LLM with a Prolog meta-interpreter. ABPR treats each candidate program as an executable declarative hypothesis of the latent rule and reifies its SLD goal--subgoal resolution into compact proof-tree-style derivations, following Shapiro's algorithmic program debugging (APD). In this view, refinement is not merely code-level debugging, but semantic re-checking of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.