PatchMatch-RL: Deep MVS with Pixelwise Depth, Normal, and Visibility
Jae Yong Lee, Joseph DeGol, Chuhang Zou, Derek Hoiem

TL;DR
PatchMatch-RL introduces a novel end-to-end trainable multi-view stereo method that combines pixelwise depth, normal, and visibility estimation with reinforcement learning to improve performance on scenes with large depth ranges and sparse views.
Contribution
It integrates PatchMatch optimization with deep learning and reinforcement learning, enabling pixelwise estimates of depth, normals, and visibility in a trainable framework.
Findings
Outperforms recent learning-based methods on ETH3D benchmark.
Performs comparably to state-of-the-art on Tanks and Temples.
Demonstrates effectiveness in scenes with large depth ranges and sparse views.
Abstract
Recent learning-based multi-view stereo (MVS) methods show excellent performance with dense cameras and small depth ranges. However, non-learning based approaches still outperform for scenes with large depth ranges and sparser wide-baseline views, in part due to their PatchMatch optimization over pixelwise estimates of depth, normals, and visibility. In this paper, we propose an end-to-end trainable PatchMatch-based MVS approach that combines advantages of trainable costs and regularizations with pixelwise estimates. To overcome the challenge of the non-differentiable PatchMatch optimization that involves iterative sampling and hard decisions, we use reinforcement learning to minimize expected photometric cost and maximize likelihood of ground truth depth and normals. We incorporate normal estimation by using dilated patch kernels, and propose a recurrent cost regularization that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Advanced Image Processing Techniques
