Towards Learning to Reason: Comparing LLMs with Neuro-Symbolic on   Arithmetic Relations in Abstract Reasoning

Michael Hersche; Giacomo Camposampiero; Roger Wattenhofer; Abu; Sebastian; Abbas Rahimi

arXiv:2412.05586·cs.AI·December 10, 2024

Towards Learning to Reason: Comparing LLMs with Neuro-Symbolic on Arithmetic Relations in Abstract Reasoning

Michael Hersche, Giacomo Camposampiero, Roger Wattenhofer, Abu, Sebastian, Abbas Rahimi

PDF

Open Access 2 Repos

TL;DR

This paper compares large language models and neuro-symbolic approaches in solving abstract reasoning tasks, revealing LLMs' weaknesses in arithmetic reasoning and demonstrating the effectiveness of neuro-symbolic methods like ARLC in maintaining high accuracy.

Contribution

It introduces a neuro-symbolic approach, ARLC, that outperforms LLMs in arithmetic reasoning within abstract visual tasks, especially in extended and challenging scenarios.

Findings

01

LLMs struggle with arithmetic rules in abstract reasoning.

02

ARLC achieves near-perfect accuracy on challenging datasets.

03

LLMs' performance drops significantly with increased attribute range.

Abstract

This work compares large language models (LLMs) and neuro-symbolic approaches in solving Raven's progressive matrices (RPM), a visual abstract reasoning test that involves the understanding of mathematical rules such as progression or arithmetic addition. Providing the visual attributes directly as textual prompts, which assumes an oracle visual perception module, allows us to measure the model's abstract reasoning capability in isolation. Despite providing such compositionally structured representations from the oracle visual perception and advanced prompting techniques, both GPT-4 and Llama-3 70B cannot achieve perfect accuracy on the center constellation of the I-RAVEN dataset. Our analysis reveals that the root cause lies in the LLM's weakness in understanding and executing arithmetic rules. As a potential remedy, we analyze the Abductive Rule Learner with Context-awareness (ARLC),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · AI-based Problem Solving and Planning

MethodsAttention Is All You Need · Adam · Dropout · Position-Wise Feed-Forward Layer · Softmax · Dense Connections · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Label Smoothing