Diffusion-guided Generalizable Enhancer for Urban Scene Reconstruction

Henry Che; Jingkang Wang; Yun Chen; Ze Yang; Sivabalan Manivasagam; Raquel Urtasun

arXiv:2605.22420·cs.CV·May 22, 2026

Diffusion-guided Generalizable Enhancer for Urban Scene Reconstruction

Henry Che, Jingkang Wang, Yun Chen, Ze Yang, Sivabalan Manivasagam, Raquel Urtasun

PDF

TL;DR

This paper introduces GenRe, a diffusion-guided enhancer that improves urban scene reconstruction, achieving high-quality, generalizable 3D representations efficiently for autonomous driving applications.

Contribution

GenRe is a novel method that distills generative priors into 3D representations, enabling rapid, robust, and generalizable urban scene reconstruction from pretrained models.

Findings

01

GenRe outperforms existing methods in quality and efficiency.

02

It generalizes reliably to unseen viewpoints like lane changes.

03

GenRe benefits downstream tasks such as sensor simulation.

Abstract

Urban scene reconstruction from real-world observations has emerged as a powerful tool for self-driving development and testing. While current neural rendering approaches achieve high-fidelity rendering along the recorded trajectories, their quality degrades significantly under large viewpoint shifts, limiting the applicability for closed-loop simulation. Recent works have shown promising results in using diffusion models to enhance quality at these challenging viewpoints and distill improvements back into 3D representations. However, they often require costly per-scene optimization, and the distilled representations remain fragile and fail to generalize beyond limited synthesized views. To address these limitations, we propose GenRe, a novel diffusion-guided generalizable enhancer for urban scene reconstruction. GenRe takes as input any pretrained 3D Gaussian representation and fixes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.