AnyView: Synthesizing Any Novel View in Dynamic Scenes

Basile Van Hoorick; Dian Chen; Shun Iwase; Pavel Tokmakov; Muhammad Zubair Irshad; Igor Vasiljevic; Swati Gupta; Fangzhou Cheng; Sergey Zakharov; Vitor Campagnolo Guizilini

arXiv:2601.16982·cs.CV·January 26, 2026

AnyView: Synthesizing Any Novel View in Dynamic Scenes

Basile Van Hoorick, Dian Chen, Shun Iwase, Pavel Tokmakov, Muhammad Zubair Irshad, Igor Vasiljevic, Swati Gupta, Fangzhou Cheng, Sergey Zakharov, Vitor Campagnolo Guizilini

PDF

Open Access

TL;DR

AnyView is a diffusion-based framework that synthesizes novel dynamic views in videos from arbitrary camera angles, leveraging diverse datasets to maintain consistency and realism in highly dynamic scenes.

Contribution

It introduces a generalist spatiotemporal implicit representation trained on multiple data sources for zero-shot dynamic view synthesis without geometric assumptions.

Findings

01

AnyView achieves competitive results on standard benchmarks.

02

It maintains high-quality, consistent videos from any viewpoint in extreme dynamic scenarios.

03

Most baselines fail under extreme conditions, while AnyView remains robust.

Abstract

Modern generative video models excel at producing convincing, high-quality outputs, but struggle to maintain multi-view and spatiotemporal consistency in highly dynamic real-world environments. In this work, we introduce \textbf{AnyView}, a diffusion-based video generation framework for \emph{dynamic view synthesis} with minimal inductive biases or geometric assumptions. We leverage multiple data sources with various levels of supervision, including monocular (2D), multi-view static (3D) and multi-view dynamic (4D) datasets, to train a generalist spatiotemporal implicit representation capable of producing zero-shot novel videos from arbitrary camera locations and trajectories. We evaluate AnyView on standard benchmarks, showing competitive results with the current state of the art, and propose \textbf{AnyViewBench}, a challenging new benchmark tailored towards \emph{extreme} dynamic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · 3D Shape Modeling and Analysis