SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object Detection
Lei Yang, Xinyu Zhang, Jun Li, Li Wang, Chuang Zhang, Li Ju, Zhiwei, Li, Yang Shen

TL;DR
This paper introduces SGV3D, a framework that enhances vision-based roadside 3D object detection by improving scenario generalization through background suppression and semi-supervised data generation, significantly outperforming previous methods on new scenes.
Contribution
The paper proposes a novel Scenario Generalization Framework with background suppression and semi-supervised data generation to improve roadside 3D detection across diverse scenes.
Findings
Significant performance improvements on large-scale benchmarks.
Over 42% gain in vehicle detection on DAIR-V2X-I.
Notable gains in pedestrian and cyclist detection.
Abstract
Roadside perception can greatly increase the safety of autonomous vehicles by extending their perception ability beyond the visual range and addressing blind spots. However, current state-of-the-art vision-based roadside detection methods possess high accuracy on labeled scenes but have inferior performance on new scenes. This is because roadside cameras remain stationary after installation and can only collect data from a single scene, resulting in the algorithm overfitting these roadside backgrounds and camera poses. To address this issue, in this paper, we propose an innovative Scenario Generalization Framework for Vision-based Roadside 3D Object Detection, dubbed SGV3D. Specifically, we employ a Background-suppressed Module (BSM) to mitigate background overfitting in vision-centric pipelines by attenuating background features during the 2D to bird's-eye-view projection. Furthermore,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Advanced Image and Video Retrieval Techniques
