A Multi-Resolution Benchmark Framework for Spatial Reasoning Assessment in Neural Networks
Manuela Imbriani, Gina Belmonte, Mieke Massink, Alessandro Tofani, Vincenzo Ciancia

TL;DR
This paper introduces a comprehensive benchmark framework for evaluating neural networks' spatial reasoning abilities, focusing on topology and geometry, revealing significant current limitations and guiding future hybrid approaches.
Contribution
The paper develops a multi-resolution benchmark framework for systematic spatial reasoning assessment in neural networks, integrating synthetic datasets, standardized training, and evaluation protocols.
Findings
Neural networks show significant challenges in basic geometric tasks.
Systematic failures observed in topological and spatial distance reasoning.
Framework enables reproducible evaluation of spatial reasoning capabilities.
Abstract
This paper presents preliminary results in the definition of a comprehensive benchmark framework designed to systematically evaluate spatial reasoning capabilities in neural networks, with a particular focus on morphological properties such as connectivity and distance relationships. The framework is currently being used to study the capabilities of nnU-Net, exploiting the spatial model checker VoxLogicA to generate two distinct categories of synthetic datasets: maze connectivity problems for topological analysis and spatial distance computation tasks for geometric understanding. Each category is evaluated across multiple resolutions to assess scalability and generalization properties. The automated pipeline encompasses a complete machine learning workflow including: synthetic dataset generation, standardized training with cross-validation, inference execution, and comprehensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Criteria Decision Making
