Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception
Haoming Chen, Zhizhong Zhang, Yanyun Qu, Ruixin Zhang, Xin Tan, Yuan, Xie

TL;DR
This paper introduces a scene-level semantic consistency framework for 3D pre-training, effectively bridging semantic segments across scenes to improve universal perception tasks in large-scale dynamic environments.
Contribution
It proposes the CSC framework that leverages vision foundation models and cross-scene prototypes to unify 3D representations across diverse scenes, enhancing pre-training effectiveness.
Findings
Improved semantic segmentation accuracy (+1.4% mIoU)
Enhanced object detection performance (+1.0% mAP)
Better panoptic segmentation (+3.0% PQ)
Abstract
An effective pre-training framework with universal 3D representations is extremely desired in perceiving large-scale dynamic scenes. However, establishing such an ideal framework that is both task-generic and label-efficient poses a challenge in unifying the representation of the same primitive across diverse scenes. The current contrastive 3D pre-training methods typically follow a frame-level consistency, which focuses on the 2D-3D relationships in each detached image. Such inconsiderate consistency greatly hampers the promising path of reaching an universal pre-training framework: (1) The cross-scene semantic self-conflict, i.e., the intense collision between primitive segments of the same semantics from different scenes; (2) Lacking a globally unified bond that pushes the cross-scene semantic consistency into 3D representation learning. To address above challenges, we propose a CSC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection · Advanced Neural Network Applications · Manufacturing Process and Optimization
