ShapeY: A Principled Framework for Measuring Shape Recognition Capacity via Nearest-Neighbor Matching
Jong Woo Nam, Amanda S. Rios, Bartlett W. Mel

TL;DR
ShapeY is a benchmarking framework that evaluates the shape recognition capacity of object recognition systems using a nearest-neighbor matching task across multiple viewpoints and appearance changes.
Contribution
It introduces a comprehensive, principled method for assessing shape understanding in vision models, highlighting current challenges in achieving viewpoint and appearance invariance.
Findings
State-of-the-art models struggle with consistent shape recognition across viewpoints.
Models often produce egregious mismatches between objects of different shapes.
ShapeY provides detailed quantitative and qualitative performance metrics.
Abstract
Object recognition (OR) in humans relies heavily on shape cues and the ability to recognize objects across varying 3D viewpoints. Unlike humans, deep networks often rely on non-shape cues such as texture and background, leading to vulnerabilities in generalization and robustness. To address this gap, we introduce ShapeY, a novel and principled benchmarking framework designed to evaluate shape-based recognition capability in OR systems. ShapeY comprises 68,200 grayscale images of 200 3D objects rendered from multiple viewpoints and optionally subjected to non-shape ``appearance'' changes. Using a nearest-neighbor matching task, ShapeY specifically probes the fine-grained structure of an OR system's embedding space by evaluating whether object views are clustered by 3D shape similarity across varying 3D viewpoints and other non-shape changes. ShapeY provides a suite of quantitative and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
