PointMCD: Boosting Deep Point Cloud Encoders via Multi-view Cross-modal Distillation for 3D Shape Recognition
Qijian Zhang, Junhui Hou, Yue Qian

TL;DR
This paper introduces PointMCD, a multi-view cross-modal distillation framework that enhances 3D point cloud encoders by transferring knowledge from deep 2D image encoders, significantly improving 3D shape recognition tasks.
Contribution
It proposes a novel distillation architecture with visibility-aware feature projection to align 2D and 3D features, boosting point cloud encoder performance without complex network modifications.
Findings
Improved 3D shape classification accuracy.
Enhanced part segmentation performance.
Effective unsupervised learning results.
Abstract
As two fundamental representation modalities of 3D objects, 3D point clouds and multi-view 2D images record shape information from different domains of geometric structures and visual appearances. In the current deep learning era, remarkable progress in processing such two data modalities has been achieved through respectively customizing compatible 3D and 2D network architectures. However, unlike multi-view image-based 2D visual modeling paradigms, which have shown leading performance in several common 3D shape recognition benchmarks, point cloud-based 3D geometric modeling paradigms are still highly limited by insufficient learning capacity, due to the difficulty of extracting discriminative features from irregular geometric signals. In this paper, we explore the possibility of boosting deep 3D point cloud encoders by transferring visual knowledge extracted from deep 2D image encoders…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Optical measurement and interference techniques
