DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving
Hongbin Lin, Yiming Yang, Chaoda Zheng, Yifan Zhang, Shuaicheng Niu, Zilu Guo, Yafeng Li, Gui Gui, Shuguang Cui, Zhen Li

TL;DR
DriveFlow is a novel method that enhances training data for 3D object detection in autonomous driving by adapting pre-trained text-to-image flow models, improving robustness against out-of-distribution scenarios.
Contribution
It introduces a rectified flow adaptation technique with frequency-based strategies to improve data augmentation for robust 3D detection without additional training.
Findings
Significant performance improvements across all categories in OOD scenarios.
Effective preservation of 3D object geometry during image editing.
Efficient data enhancement method validated by comprehensive experiments.
Abstract
In autonomous driving, vision-centric 3D object detection recognizes and localizes 3D objects from RGB images. However, due to high annotation costs and diverse outdoor scenes, training data often fails to cover all possible test scenarios, known as the out-of-distribution (OOD) issue. Training-free image editing offers a promising solution for improving model robustness by training data enhancement without any modifications to pre-trained diffusion models. Nevertheless, inversion-based methods often suffer from limited effectiveness and inherent inaccuracies, while recent rectified-flow-based approaches struggle to preserve objects with accurate 3D geometry. In this paper, we propose DriveFlow, a Rectified Flow Adaptation method for training data enhancement in autonomous driving based on pre-trained Text-to-Image flow models. Based on frequency decomposition, DriveFlow introduces two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Image Enhancement Techniques
