FF3R: Feedforward Feature 3D Reconstruction from Unconstrained views

Chaoyi Zhou; Run Wang; Feng Luo; Mert D. Pes\'e; Zhiwen Fan; Yiqi Zhong; Siyu Huang

arXiv:2604.09862·cs.CV·April 14, 2026

FF3R: Feedforward Feature 3D Reconstruction from Unconstrained views

Chaoyi Zhou, Run Wang, Feng Luo, Mert D. Pes\'e, Zhiwen Fan, Yiqi Zhong, Siyu Huang

PDF

TL;DR

FF3R is a novel, fully annotation-free framework that unifies geometric and semantic 3D reasoning from unconstrained multi-view images, improving various vision tasks without requiring camera poses or labels.

Contribution

It introduces a scalable, annotation-free method combining geometric and semantic reasoning with innovative modules for global and local consistency.

Findings

01

Outperforms previous methods in novel-view synthesis

02

Achieves superior open-vocabulary semantic segmentation

03

Demonstrates strong generalization to in-the-wild scenarios

Abstract

Recent advances in vision foundation models have revolutionized geometry reconstruction and semantic understanding. Yet, most of the existing approaches treat these capabilities in isolation, leading to redundant pipelines and compounded errors. This paper introduces FF3R, a fully annotation-free feed-forward framework that unifies geometric and semantic reasoning from unconstrained multi-view image sequences. Unlike previous methods, FF3R does not require camera poses, depth maps, or semantic labels, relying solely on rendering supervision for RGB and feature maps, establishing a scalable paradigm for unified 3D reasoning. In addition, we address two critical challenges in feedforward feature reconstruction pipelines, namely global semantic inconsistency and local structural inconsistency, through two key innovations: (i) a Token-wise Fusion Module that enriches geometry tokens with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.