Hierarchical Abstraction Enables Human-Like 3D Object Recognition in Deep Learning Models
Shuhao Fu, Philip J. Kellman, Hongjing Lu

TL;DR
This paper investigates how hierarchical abstraction in deep learning models, especially point transformers, enables human-like recognition of 3D objects from sparse point cloud data, highlighting the importance of global shape representation.
Contribution
It demonstrates that hierarchical abstraction in point transformer models better mimics human 3D shape recognition than convolutional models, emphasizing the role of global shape processing.
Findings
Point transformer models outperform convolutional models in 3D recognition tasks.
Hierarchical abstraction mechanisms improve global shape understanding.
Humans perform consistently across various 3D recognition conditions.
Abstract
Both humans and deep learning models can recognize objects from 3D shapes depicted with sparse visual information, such as a set of points randomly sampled from the surfaces of 3D objects (termed a point cloud). Although deep learning models achieve human-like performance in recognizing objects from 3D shapes, it remains unclear whether these models develop 3D shape representations similar to those used by human vision for object recognition. We hypothesize that training with 3D shapes enables models to form representations of local geometric structures in 3D shapes. However, their representations of global 3D object shapes may be limited. We conducted two human experiments systematically manipulating point density and object orientation (Experiment 1), and local geometric structure (Experiment 2). Humans consistently performed well across all experimental conditions. We compared two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Advanced Neural Network Applications
