Hallucinating 360{\deg}: Panoramic Street-View Generation via Local Scenes Diffusion and Probabilistic Prompting

Fei Teng; Kai Luo; Sheng Wu; Siyu Li; Pujun Guo; Jiale Wei; Jiaming Zhang; Kunyu Peng; Kailun Yang

arXiv:2507.06971·cs.CV·February 16, 2026

Hallucinating 360{\deg}: Panoramic Street-View Generation via Local Scenes Diffusion and Probabilistic Prompting

Fei Teng, Kai Luo, Sheng Wu, Siyu Li, Pujun Guo, Jiale Wei, Jiaming Zhang, Kunyu Peng, Kailun Yang

PDF

Open Access

TL;DR

This paper introduces Percep360, a novel panoramic street-view generation method for autonomous driving, utilizing local scene diffusion and probabilistic prompting to produce coherent, controllable, and high-quality panoramic images that improve perception tasks.

Contribution

Percep360 is the first panoramic generation approach that leverages stitched images as supervision, combining diffusion and probabilistic prompting for coherence and controllability.

Findings

01

Generated images outperform stitched images in quality metrics.

02

Enhanced downstream BEV segmentation performance.

03

Effective controllability of panoramic image generation.

Abstract

Panoramic perception holds significant potential for autonomous driving, enabling vehicles to acquire a comprehensive 360{\deg} surround view in a single shot. However, autonomous driving is a data-driven task. Complete panoramic data acquisition requires complex sampling systems and annotation pipelines, which are time-consuming and labor-intensive. Although existing street view generation models have demonstrated strong data regeneration capabilities, they can only learn from the fixed data distribution of existing datasets and cannot leverage stitched pinhole images as a supervisory signal. In this paper, we propose the first panoramic generation method Percep360 for autonomous driving. Percep360 enables coherent generation of panoramic data with control signals based on the stitched panoramic data. Percep360 focuses on two key aspects: coherence and controllability. Specifically, to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Visual Attention and Saliency Detection · Robotics and Sensor-Based Localization