Street-View Image Generation from a Bird's-Eye View Layout
Alexander Swerdlow, Runsheng Xu, Bolei Zhou

TL;DR
This paper introduces BEVGen, a conditional generative model that creates realistic street-view images from a bird's-eye view layout, enhancing autonomous driving simulation and perception.
Contribution
BEVGen employs a novel cross-view transformation with spatial attention to generate spatially consistent street images from BEV layouts, advancing data-driven simulation methods.
Findings
Accurately renders road and lane lines.
Generates diverse traffic scenes with different weather and lighting.
Achieves high spatial consistency in synthesized images.
Abstract
Bird's-Eye View (BEV) Perception has received increasing attention in recent years as it provides a concise and unified spatial representation across views and benefits a diverse set of downstream driving applications. At the same time, data-driven simulation for autonomous driving has been a focal point of recent research but with few approaches that are both fully data-driven and controllable. Instead of using perception data from real-life scenarios, an ideal model for simulation would generate realistic street-view images that align with a given HD map and traffic layout, a task that is critical for visualizing complex traffic scenarios and developing robust perception models for autonomous driving. In this paper, we propose BEVGen, a conditional generative model that synthesizes a set of realistic and spatially consistent surrounding images that match the BEV layout of a traffic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Remote Sensing and LiDAR Applications · Image Processing and 3D Reconstruction
MethodsALIGN
