Dominant Set Clustering and Pooling for Multi-View 3D Object Recognition
Chu Wang, Marcello Pelillo, Kaleem Siddiqi

TL;DR
This paper introduces a dominant set clustering and pooling layer for multi-view 3D object recognition, significantly improving accuracy by aggregating similar views and enhancing CNN performance.
Contribution
It proposes a novel recurrent clustering and pooling module based on dominant sets, boosting multi-view 3D recognition accuracy and training efficiency.
Findings
Achieved 93.8% accuracy on ModelNet 40, setting a new state of the art.
Fast approximate learning reduces training time with minimal accuracy loss.
The method effectively pools similar views to improve recognition performance.
Abstract
View based strategies for 3D object recognition have proven to be very successful. The state-of-the-art methods now achieve over 90% correct category level recognition performance on appearance images. We improve upon these methods by introducing a view clustering and pooling layer based on dominant sets. The key idea is to pool information from views which are similar and thus belong to the same cluster. The pooled feature vectors are then fed as inputs to the same layer, in a recurrent fashion. This recurrent clustering and pooling module, when inserted in an off-the-shelf pretrained CNN, boosts performance for multi-view 3D object recognition, achieving a new state of the art test set recognition accuracy of 93.8% on the ModelNet 40 database. We also explore a fast approximate learning strategy for our cluster-pooling CNN, which, while sacrificing end-to-end learning, greatly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Face recognition and analysis
