Fully Self-Supervised Depth Estimation from Defocus Clue
Haozhe Si, Bin Zhao, Dong Wang, Yunpeng Gao, Mulin Chen, Zhigang Wang,, Xuelong Li

TL;DR
This paper introduces a fully self-supervised depth estimation method from sparse focal stacks, eliminating the need for ground-truth depth or all-in-focus images, thus enabling practical real-world applications.
Contribution
It proposes a novel self-supervised framework for depth-from-defocus that works without ground-truth data and uses an optical model for validation and refinement.
Findings
Outperforms existing methods on benchmark datasets
Works effectively with real focal stacks
Provides a strong baseline for self-supervised DFD
Abstract
Depth-from-defocus (DFD), modeling the relationship between depth and defocus pattern in images, has demonstrated promising performance in depth estimation. Recently, several self-supervised works try to overcome the difficulties in acquiring accurate depth ground-truth. However, they depend on the all-in-focus (AIF) images, which cannot be captured in real-world scenarios. Such limitation discourages the applications of DFD methods. To tackle this issue, we propose a completely self-supervised framework that estimates depth purely from a sparse focal stack. We show that our framework circumvents the needs for the depth and AIF image ground-truth, and receives superior predictions, thus closing the gap between the theoretical success of DFD works and their applications in the real world. In particular, we propose (i) a more realistic setting for DFD tasks, where no depth or AIF image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Cell Image Analysis Techniques · Optical Coherence Tomography Applications
