FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent
Cameron Smith, David Charatan, Ayush Tewari, Vincent Sitzmann

TL;DR
FlowMap is a differentiable approach that jointly optimizes camera poses, intrinsics, and dense depth from videos, enabling high-quality novel view synthesis and outperforming traditional methods in accuracy and realism.
Contribution
The paper introduces FlowMap, a novel end-to-end differentiable method for jointly estimating camera parameters and dense depth, surpassing prior gradient-based methods and rivaling state-of-the-art SfM techniques.
Findings
FlowMap achieves photo-realistic novel view synthesis on 360-degree trajectories.
It outperforms prior gradient-descent bundle adjustment methods.
Performs on par with COLMAP on downstream view synthesis tasks.
Abstract
This paper introduces FlowMap, an end-to-end differentiable method that solves for precise camera poses, camera intrinsics, and per-frame dense depth of a video sequence. Our method performs per-video gradient-descent minimization of a simple least-squares objective that compares the optical flow induced by depth, intrinsics, and poses against correspondences obtained via off-the-shelf optical flow and point tracking. Alongside the use of point tracks to encourage long-term geometric consistency, we introduce differentiable re-parameterizations of depth, intrinsics, and pose that are amenable to first-order optimization. We empirically show that camera parameters and dense depth recovered by our method enable photo-realistic novel view synthesis on 360-degree trajectories using Gaussian Splatting. Our method not only far outperforms prior gradient-descent based bundle adjustment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · 3D Surveying and Cultural Heritage
