PatchContrast: Self-Supervised Pre-training for 3D Object Detection

Oren Shrout; Ori Nizan; Yizhak Ben-Shabat; Ayellet Tal

arXiv:2308.06985·cs.CV·April 15, 2025·1 cites

PatchContrast: Self-Supervised Pre-training for 3D Object Detection

Oren Shrout, Ori Nizan, Yizhak Ben-Shabat, Ayellet Tal

PDF

Open Access

TL;DR

PatchContrast is a self-supervised pre-training framework for 3D object detection that leverages proposal and patch levels of abstraction to improve detection accuracy without requiring labeled data.

Contribution

Introduces a novel self-supervised pre-training method using proposal and patch levels for 3D object detection in point clouds.

Findings

01

Outperforms state-of-the-art models on three datasets.

02

Enhances downstream 3D detection performance.

03

Effective across various backbone architectures.

Abstract

Accurately detecting objects in the environment is a key challenge for autonomous vehicles. However, obtaining annotated data for detection is expensive and time-consuming. We introduce PatchContrast, a novel self-supervised point cloud pre-training framework for 3D object detection. We propose to utilize two levels of abstraction to learn discriminative representation from unlabeled data: proposal-level and patch-level. The proposal-level aims at localizing objects in relation to their surroundings, whereas the patch-level adds information about the internal connections between the object's components, hence distinguishing between different objects based on their individual components. We demonstrate how these levels can be integrated into self-supervised pre-training for various backbones to enhance the downstream 3D detection task. We show that our method outperforms existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Remote Sensing and LiDAR Applications · 3D Surveying and Cultural Heritage