Geo-EVS: Geometry-Conditioned Extrapolative View Synthesis for Autonomous Driving

Yatong Lan; Rongkui Tang; Lei He

arXiv:2604.07250·cs.CV·April 9, 2026

Geo-EVS: Geometry-Conditioned Extrapolative View Synthesis for Autonomous Driving

Yatong Lan, Rongkui Tang, Lei He

PDF

TL;DR

Geo-EVS introduces a geometry-conditioned framework for extrapolative view synthesis in autonomous driving, enhancing accuracy and robustness in sparse and out-of-trajectory scenarios.

Contribution

It presents a novel geometry-aware reprojection method and artifact-guided latent diffusion to improve extrapolative view synthesis under sparse supervision.

Findings

01

Improves sparse-view synthesis quality on Waymo dataset.

02

Enhances geometric accuracy in high-angle and low-coverage settings.

03

Boosts downstream 3D detection performance.

Abstract

Extrapolative novel view synthesis can reduce camera-rig dependency in autonomous driving by generating standardized virtual views from heterogeneous sensors. Existing methods degrade outside recorded trajectories because extrapolated poses provide weak geometric support and no dense target-view supervision. The key is to explicitly expose the model to out-of-trajectory condition defects during training. We propose Geo-EVS, a geometry-conditioned framework under sparse supervision. Geo-EVS has two components. Geometry-Aware Reprojection (GAR) uses fine-tuned VGGT to reconstruct colored point clouds and reproject them to observed and virtual target poses, producing geometric condition maps. This design unifies the reprojection path between training and inference. Artifact-Guided Latent Diffusion (AGLD) injects reprojection-derived artifact masks during training so the model learns to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.