Approaching human 3D shape perception with neurally mappable models
Thomas P. O'Connell, Tyler Bonnen, Yoni Friedman, Ayush Tewari, Josh B. Tenenbaum, Vincent Sitzmann, Nancy Kanwisher

TL;DR
This paper investigates how 3D neural field models, trained with multi-view learning, can better replicate human 3D shape perception and matching across viewpoints, highlighting the importance of training methods for human-like performance.
Contribution
It demonstrates that 3D neural field models trained with multi-view objectives align more closely with human shape perception, advancing understanding of computational mechanisms behind 3D shape inference.
Findings
3D-LFN models support human-like 3D shape matching
Multi-view training improves model-human alignment
Models partially generalize to new categories but still differ from humans
Abstract
Humans effortlessly infer the 3D shape of objects. What computations underlie this ability? Although various computational models have been proposed, none of them capture the human ability to match object shape across viewpoints. Here, we ask whether and how this gap might be closed. We begin with a relatively novel class of computational models, 3D neural fields, which encapsulate the basic principles of classic analysis-by-synthesis in a deep neural network (DNN). First, we find that a 3D Light Field Network (3D-LFN) supports 3D matching judgments well aligned to humans for within-category comparisons, adversarially-defined comparisons that accentuate the 3D failure cases of standard DNN models, and adversarially-defined comparisons for algorithmically generated shapes with no category structure. We then investigate the source of the 3D-LFN's ability to achieve human-aligned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Visual perception and processing mechanisms · Advanced Vision and Imaging
MethodsNone
