DP-SfM: Dual-Pixel Structure-from-Motion without Scale Ambiguity

Lilika Makabe; Kohei Ashida; Hiroaki Santo; Fumio Okura; and Yasuyuki Matsushita

arXiv:2605.01852·cs.CV·May 18, 2026

DP-SfM: Dual-Pixel Structure-from-Motion without Scale Ambiguity

Lilika Makabe, Kohei Ashida, Hiroaki Santo, Fumio Okura, and Yasuyuki Matsushita

PDF

1 Repo

TL;DR

This paper introduces a novel method leveraging dual-pixel sensor data to automatically resolve scale ambiguity in multi-view 3D reconstruction without needing reference objects or prior calibration.

Contribution

It presents a simple linear approach utilizing defocus blur in dual-pixel images to determine absolute scale in structure-from-motion, enhancing reconstruction accuracy.

Findings

01

Effectively resolves scale ambiguity without reference objects.

02

Works across diverse scenes and camera setups.

03

Provides publicly available code and data.

Abstract

Multi-view 3D reconstruction, namely, structure-from-motion followed by multi-view stereo, is a fundamental component of 3D computer vision. In general, multi-view 3D reconstruction suffers from an unknown scale ambiguity unless a reference object of known size is present in the scene. In this article, we show that multi-view images captured using a dual-pixel (DP) sensor can automatically resolve the scale ambiguity, without requiring a reference object or prior calibration. Specifically, the defocus blur observed in DP images provides sufficient information to determine the absolute scale when paired with depth maps (up to scale) recovered from multi-view 3D reconstruction. Based on this observation, we develop a simple yet effective linear method to estimate the absolute scale, followed by the intensity-based optimization stage that aligns the left and right DP images by shifting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lilika-makabe/dp-sfm-tpami.git
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.