TL;DR
This paper introduces a multi-scale architecture for abstract relational reasoning tasks, combining different resolutions to improve pattern detection and outperform existing methods on benchmarks.
Contribution
It proposes a novel multi-resolution approach that specializes in different relational rules and introduces a new dataset variant, RAVEN-FAIR, to address dataset biases.
Findings
Outperforms state-of-the-art on all benchmarks by 5-54%
Different resolutions excel at different types of relations
Proposed dataset modification reduces bias and improves evaluation
Abstract
We consider the abstract relational reasoning task, which is commonly used as an intelligence test. Since some patterns have spatial rationales, while others are only semantic, we propose a multi-scale architecture that processes each query in multiple resolutions. We show that indeed different rules are solved by different resolutions and a combined multi-scale approach outperforms the existing state of the art in this task on all benchmarks by 5-54%. The success of our method is shown to arise from multiple novelties. First, it searches for relational patterns in multiple resolutions, which allows it to readily detect visual relations, such as location, in higher resolution, while allowing the lower resolution module to focus on semantic relations, such as shape type. Second, we optimize the reasoning network of each resolution proportionally to its performance, hereby we motivate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
