A-I-RAVEN and I-RAVEN-Mesh: Two New Benchmarks for Abstract Visual Reasoning

Miko{\l}aj Ma{\l}ki\'nski; Jacek Ma\'ndziuk

arXiv:2406.11061·cs.AI·May 19, 2025

A-I-RAVEN and I-RAVEN-Mesh: Two New Benchmarks for Abstract Visual Reasoning

Miko{\l}aj Ma{\l}ki\'nski, Jacek Ma\'ndziuk

PDF

Open Access

TL;DR

This paper introduces two new benchmarks, A-I-RAVEN and I-RAVEN-Mesh, to evaluate the generalization and knowledge transfer capabilities of deep neural networks in abstract visual reasoning using Raven's Progressive Matrices.

Contribution

The paper presents A-I-RAVEN with 10 generalization regimes and I-RAVEN-Mesh with line-based patterns, providing new tools for systematic evaluation of AVR models.

Findings

01

Models show limited generalization to unseen attributes.

02

Models struggle with compositionality and complex configurations.

03

Evaluation reveals specific shortcomings in transfer learning.

Abstract

We study generalization and knowledge reuse capabilities of deep neural networks in the domain of abstract visual reasoning (AVR), employing Raven's Progressive Matrices (RPMs), a recognized benchmark task for assessing AVR abilities. Two knowledge transfer scenarios referring to the I-RAVEN dataset are investigated. Firstly, inspired by generalization assessment capabilities of the PGM dataset and popularity of I-RAVEN, we introduce Attributeless-I-RAVEN (A-I-RAVEN), a benchmark with 10 generalization regimes that allow to systematically test generalization of abstract rules applied to held-out attributes at various levels of complexity (primary and extended regimes). In contrast to PGM, A-I-RAVEN features compositionality, a variety of figure configurations, and does not require substantial computational resources. Secondly, we construct I-RAVEN-Mesh, a dataset that enriches RPMs with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsConstraint Satisfaction and Optimization · Rough Sets and Fuzzy Logic · AI-based Problem Solving and Planning

MethodsProbability Guided Maxout · Convolution