ConceptSearch: Towards Efficient Program Search Using LLMs for Abstraction and Reasoning Corpus (ARC)
Kartik Singhal, Gautam Shroff

TL;DR
ConceptSearch is a novel algorithm that uses LLMs and concept-based scoring to improve program search efficiency and accuracy on the challenging ARC benchmark, demonstrating significant performance gains.
Contribution
We introduce ConceptSearch, a new function-search algorithm combining LLMs with concept-based scoring to enhance program synthesis for complex generalization tasks.
Findings
Achieved significant performance improvement over GPT-4 prompting.
Concept-based scoring is up to 30% more efficient than Hamming distance.
Demonstrated effectiveness on the ARC challenge.
Abstract
The Abstraction and Reasoning Corpus (ARC) poses a significant challenge to artificial intelligence, demanding broad generalization and few-shot learning capabilities that remain elusive for current deep learning methods, including large language models (LLMs). While LLMs excel in program synthesis, their direct application to ARC yields limited success. To address this, we introduce ConceptSearch, a novel function-search algorithm that leverages LLMs for program generation and employs a concept-based scoring method to guide the search efficiently. Unlike simplistic pixel-based metrics like Hamming distance, ConceptSearch evaluates programs on their ability to capture the underlying transformation concept reflected in the input-output examples. We explore three scoring functions: Hamming distance, a CNN-based scoring function, and an LLM-based natural language scoring function.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsSemantic Web and Ontologies · Service-Oriented Architecture and Web Services · Advanced Database Systems and Queries
MethodsAttention Is All You Need · Adam · Dropout · Position-Wise Feed-Forward Layer · Softmax · Dense Connections · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Label Smoothing
