YOLO-BEV: Generating Bird's-Eye View in the Same Way as 2D Object Detection
Chang Liu, Liguo Zhou, Yanliang Huang, Alois Knoll

TL;DR
YOLO-BEV is an efficient framework that uses multiple cameras and a modified YOLO detection system to generate real-time bird's-eye view maps for vehicle perception, enhancing autonomous driving safety and navigation.
Contribution
It introduces a novel multi-camera setup combined with a custom detection head in YOLO to produce real-time bird's-eye view maps for vehicular perception.
Findings
Feasibility demonstrated in real-time perception tasks
Streamlined architecture with minimized parameters
Potential for rapid deployment in autonomous systems
Abstract
Vehicle perception systems strive to achieve comprehensive and rapid visual interpretation of their surroundings for improved safety and navigation. We introduce YOLO-BEV, an efficient framework that harnesses a unique surrounding cameras setup to generate a 2D bird's-eye view of the vehicular environment. By strategically positioning eight cameras, each at a 45-degree interval, our system captures and integrates imagery into a coherent 3x3 grid format, leaving the center blank, providing an enriched spatial representation that facilitates efficient processing. In our approach, we employ YOLO's detection mechanism, favoring its inherent advantages of swift response and compact model structure. Instead of leveraging the conventional YOLO detection head, we augment it with a custom-designed detection head, translating the panoramically captured data into a unified bird's-eye view map of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Video Surveillance and Tracking Methods
