GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual   Pre-training in Autonomous Driving

Shaoqing Xu; Fang Li; Shengyin Jiang; Ziying Song; Li Liu; Zhi-xin; Yang

arXiv:2411.12452·cs.CV·November 20, 2024

GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving

Shaoqing Xu, Fang Li, Shengyin Jiang, Ziying Song, Li Liu, Zhi-xin, Yang

PDF

Open Access 1 Repo

TL;DR

GaussianPretrain introduces a unified 3D Gaussian representation for holistic scene understanding in autonomous driving, significantly improving pre-training efficiency and perception task performance.

Contribution

It presents a novel pre-training paradigm that integrates geometric and texture scene information using 3D Gaussian anchors, enhancing autonomous driving perception models.

Findings

01

Achieves 40.6% faster pre-training than NeRF-based methods

02

Increases 3D object detection NDS by 7.05%

03

Improves HD map construction mAP by 1.9%

Abstract

Self-supervised learning has made substantial strides in image processing, while visual pre-training for autonomous driving is still in its infancy. Existing methods often focus on learning geometric scene information while neglecting texture or treating both aspects separately, hindering comprehensive scene understanding. In this context, we are excited to introduce GaussianPretrain, a novel pre-training paradigm that achieves a holistic understanding of the scene by uniformly integrating geometric and texture representations. Conceptualizing 3D Gaussian anchors as volumetric LiDAR points, our method learns a deepened understanding of scenes to enhance pre-training performance with detailed spatial structure and texture, achieving that 40.6% faster than NeRF-based method UniPAD with 70% GPU memory only. We demonstrate the effectiveness of GaussianPretrain across multiple 3D perception…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

public-bots/gaussianpretrain
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety

MethodsFocus