ShapeY: Measuring Shape Recognition Capacity Using Nearest Neighbor Matching
Jong Woo Nam, Amanda S. Rios, Bartlett W. Mel

TL;DR
ShapeY is a new benchmark for assessing shape recognition in vision systems using nearest neighbor view matching, revealing current models' difficulties with viewpoint and appearance variations.
Contribution
We introduce ShapeY, a benchmark for measuring shape recognition capacity through view matching, enabling precise difficulty control and revealing limitations of existing models.
Findings
ResNet50 shows high error rates with viewpoint changes
Appearance variations significantly disrupt matching accuracy
Embedding space of ResNet50 is highly tangled and complex
Abstract
Object recognition in humans depends primarily on shape cues. We have developed a new approach to measuring the shape recognition performance of a vision system based on nearest neighbor view matching within the system's embedding space. Our performance benchmark, ShapeY, allows for precise control of task difficulty, by enforcing that view matching span a specified degree of 3D viewpoint change and/or appearance change. As a first test case we measured the performance of ResNet50 pre-trained on ImageNet. Matching error rates were high. For example, a 27 degree change in object pitch led ResNet50 to match the incorrect object 45% of the time. Appearance changes were also highly disruptive. Examination of false matches indicates that ResNet50's embedding space is severely "tangled". These findings suggest ShapeY can be a useful tool for charting the progress of artificial vision systems…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
