PointContrast: Unsupervised Pre-training for 3D Point Cloud   Understanding

Saining Xie; Jiatao Gu; Demi Guo; Charles R. Qi; Leonidas J. Guibas,; Or Litany

arXiv:2007.10985·cs.CV·November 24, 2020·95 cites

PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding

Saining Xie, Jiatao Gu, Demi Guo, Charles R. Qi, Leonidas J. Guibas,, Or Litany

PDF

Open Access 2 Repos

TL;DR

This paper demonstrates that unsupervised pre-training significantly improves 3D point cloud understanding tasks across multiple benchmarks, comparable to supervised methods, and highlights the potential of scaling data collection over annotation.

Contribution

It introduces a unified contrastive pre-training approach for 3D point clouds that enhances performance on high-level scene understanding tasks across diverse datasets.

Findings

01

Unsupervised pre-training improves segmentation and detection results.

02

Pre-training generalizes well across indoor and outdoor, real and synthetic datasets.

03

Scaling data collection may be more beneficial than detailed annotation.

Abstract

Arguably one of the top success stories of deep learning is transfer learning. The finding that pre-training a network on a rich source set (eg., ImageNet) can help boost performance once fine-tuned on a usually much smaller target set, has been instrumental to many applications in language and vision. Yet, very little is known about its usefulness in 3D point cloud understanding. We see this as an opportunity considering the effort required for annotating data in 3D. In this work, we aim at facilitating research on 3D representation learning. Different from previous works, we focus on high-level scene understanding tasks. To this end, we select a suite of diverse datasets and tasks to measure the effect of unsupervised pre-training on a large source set of 3D scenes. Our findings are extremely encouraging: using a unified triplet of architecture, source dataset, and contrastive loss…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning