ViewFormer: View Set Attention for Multi-view 3D Shape Understanding
Hongyu Sun, Yongcai Wang, Peng Wang, Xudong Cai, Deying Li

TL;DR
ViewFormer introduces a novel view set attention mechanism for multi-view 3D shape recognition, achieving state-of-the-art accuracy with a simple model that captures complex view correlations.
Contribution
The paper proposes a new view set perspective and an adaptive attention model for multi-view 3D shape understanding, improving flexibility and performance over existing methods.
Findings
Achieves 98.8% accuracy on ModelNet40, surpassing previous methods.
Attains 98.4% accuracy on RGBD dataset, significantly better than baselines.
Sets new records on SHREC'17 3D shape retrieval benchmark.
Abstract
This paper presents ViewFormer, a simple yet effective model for multi-view 3d shape recognition and retrieval. We systematically investigate the existing methods for aggregating multi-view information and propose a novel ``view set" perspective, which minimizes the relation assumption about the views and releases the representation flexibility. We devise an adaptive attention model to capture pairwise and higher-order correlations of the elements in the view set. The learned multi-view correlations are aggregated into an expressive view set descriptor for recognition and retrieval. Experiments show the proposed method unleashes surprising capabilities across different tasks and datasets. For instance, with only 2 attention blocks and 4.8M learnable parameters, ViewFormer reaches 98.8% recognition accuracy on ModelNet40 for the first time, exceeding previous best method by 1.1% . On the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Human Pose and Action Recognition · 3D Surveying and Cultural Heritage
