Self-Supervised Representation Learning with Joint Embedding Predictive Architecture for Automotive LiDAR Object Detection
Haoran Zhu, Zhenyuan Dong, Kristi Topollai, Beiyao Sha, Anna Choromanska

TL;DR
This paper introduces AD-L-JEPA, a novel self-supervised pre-training framework for automotive LiDAR object detection that improves performance and efficiency without relying on contrastive or generative methods.
Contribution
AD-L-JEPA is the first JEPA-based pre-training method for autonomous driving, offering a contrastive- and generative-free approach that enhances detection accuracy and reduces computational resources.
Findings
Consistent improvements on KITTI3D, Waymo, and ONCE datasets.
Reduces GPU hours by up to 2.7x and memory by up to 4x.
Pre-training yields significant mAP gains on large datasets.
Abstract
Recently, self-supervised representation learning relying on vast amounts of unlabeled data has been explored as a pre-training method for autonomous driving. However, directly applying popular contrastive or generative methods to this problem is insufficient and may even lead to negative transfer. In this paper, we present AD-L-JEPA, a novel self-supervised pre-training framework with a joint embedding predictive architecture (JEPA) for automotive LiDAR object detection. Unlike existing methods, AD-L-JEPA is neither generative nor contrastive. Instead of explicitly generating masked regions, our method predicts Bird's-Eye-View embeddings to capture the diverse nature of driving scenes. Furthermore, our approach eliminates the need to manually form contrastive pairs by employing explicit variance regularization to avoid representation collapse. Experimental results demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Autonomous Vehicle Technology and Safety · Remote Sensing and LiDAR Applications
