Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models
Hai Wang, Xiaoyu Xiang, Yuchen Fan, Jing-Hao Xue

TL;DR
This paper introduces a method for customizing 360-degree panoramas using text-to-image diffusion models, addressing global geometry and seamless stitching through a specialized dataset, fine-tuning, and a novel StitchDiffusion technique.
Contribution
It presents a new approach combining dataset curation, model fine-tuning with LoRA, and StitchDiffusion to generate seamless, high-quality 360-degree panoramas from text prompts.
Findings
Effective generation of seamless 360-degree panoramas.
Model generalizes well to unseen scenes.
StitchDiffusion improves continuity between image edges.
Abstract
Personalized text-to-image (T2I) synthesis based on diffusion models has attracted significant attention in recent research. However, existing methods primarily concentrate on customizing subjects or styles, neglecting the exploration of global geometry. In this study, we propose an approach that focuses on the customization of 360-degree panoramas, which inherently possess global geometric properties, using a T2I diffusion model. To achieve this, we curate a paired image-text dataset specifically designed for the task and subsequently employ it to fine-tune a pre-trained T2I diffusion model with LoRA. Nevertheless, the fine-tuned model alone does not ensure the continuity between the leftmost and rightmost sides of the synthesized images, a crucial characteristic of 360-degree panoramas. To address this issue, we propose a method called StitchDiffusion. Specifically, we perform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Customizing 360-Degree Panoramas Through Text-to-Image Diffusion Models· youtube
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques
MethodsDiffusion
