Feed-Forward 3D Scene Modeling: A Problem-Driven Perspective

Weijie Wang; Qihang Cao; Sensen Gao; Donny Y. Chen; Haofei Xu; Wenjing Bian; Songyou Peng; Tat-Jen Cham; Chuanxia Zheng; Andreas Geiger; Jianfei Cai; Jia-Wang Bian; Bohan Zhuang

arXiv:2604.14025·cs.CV·April 16, 2026

Feed-Forward 3D Scene Modeling: A Problem-Driven Perspective

Weijie Wang, Qihang Cao, Sensen Gao, Donny Y. Chen, Haofei Xu, Wenjing Bian, Songyou Peng, Tat-Jen Cham, Chuanxia Zheng, Andreas Geiger, Jianfei Cai, Jia-Wang Bian, Bohan Zhuang

PDF

1 Repo

TL;DR

This paper surveys recent feed-forward 3D scene modeling methods, emphasizing model design strategies across diverse geometric representations, and proposes a taxonomy to organize research directions and applications.

Contribution

It introduces a novel, output-agnostic taxonomy of model design strategies for feed-forward 3D reconstruction and provides a comprehensive review of benchmarks, datasets, and applications.

Findings

01

Shared architectural patterns across diverse geometric representations

02

Five key research problems: feature enhancement, geometry awareness, efficiency, augmentation, temporal models

03

Discussion of future challenges like scalability and evaluation standards

Abstract

Reconstructing 3D representations from 2D inputs is a fundamental task in computer vision and graphics, serving as a cornerstone for understanding and interacting with the physical world. While traditional methods achieve high fidelity, they are limited by slow per-scene optimization or category-specific training, which hinders their practical deployment and scalability. Hence, generalizable feed-forward 3D reconstruction has witnessed rapid development in recent years. By learning a model that maps images directly to 3D representations in a single forward pass, these methods enable efficient reconstruction and robust cross-scene generalization. Our survey is motivated by a critical observation: despite the diverse geometric output representations, ranging from implicit fields to explicit primitives, existing feed-forward approaches share similar high-level architectural patterns, such…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ziplab/Awesome-Feed-Forward-3D
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.