PMT-MAE: Dual-Branch Self-Supervised Learning with Distillation for Efficient Point Cloud Classification
Qiang Zheng, Chao Zhang, Jian Sun

TL;DR
PMT-MAE introduces a dual-branch self-supervised learning framework combining Transformer and MLP for efficient and accurate 3D point cloud classification, guided by knowledge distillation from a teacher model.
Contribution
It proposes a novel dual-branch architecture with a fusion mechanism and distillation strategy, achieving state-of-the-art accuracy with high efficiency on point cloud classification.
Findings
Achieves 93.6% accuracy on ModelNet40 without voting.
Surpasses baseline Point-MAE and teacher Point-M2AE in accuracy.
Requires only 40 epochs for pre-training and fine-tuning.
Abstract
Advances in self-supervised learning are essential for enhancing feature extraction and understanding in point cloud processing. This paper introduces PMT-MAE (Point MLP-Transformer Masked Autoencoder), a novel self-supervised learning framework for point cloud classification. PMT-MAE features a dual-branch architecture that integrates Transformer and MLP components to capture rich features. The Transformer branch leverages global self-attention for intricate feature interactions, while the parallel MLP branch processes tokens through shared fully connected layers, offering a complementary feature transformation pathway. A fusion mechanism then combines these features, enhancing the model's capacity to learn comprehensive 3D representations. Guided by the sophisticated teacher model Point-M2AE, PMT-MAE employs a distillation strategy that includes feature distillation during…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications · 3D Shape Modeling and Analysis
MethodsByte Pair Encoding · Absolute Position Encodings · Softmax · Label Smoothing · Linear Layer · Adam · Dropout · Layer Normalization · Dense Connections · Attention Is All You Need
