When Epipolar Constraint Meets Non-local Operators in Multi-View Stereo
Tianqi Liu, Xinyi Ye, Weiyue Zhao, Zhiyu Pan, Min Shi, Zhiguo Cao

TL;DR
This paper introduces an epipolar line-constrained non-local feature augmentation method for multi-view stereo, significantly improving efficiency and accuracy by leveraging epipolar geometry to limit the search space.
Contribution
The paper proposes a novel epipolar line-based non-local augmentation strategy and an Epipolar Transformer, enhancing multi-view stereo performance while reducing computational overhead.
Findings
Achieves state-of-the-art results on DTU and Tanks-and-Temples benchmarks.
Reduces computational complexity by constraining non-local operations to epipolar lines.
Demonstrates improved reconstruction accuracy with efficient feature aggregation.
Abstract
Learning-based multi-view stereo (MVS) method heavily relies on feature matching, which requires distinctive and descriptive representations. An effective solution is to apply non-local feature aggregation, e.g., Transformer. Albeit useful, these techniques introduce heavy computation overheads for MVS. Each pixel densely attends to the whole image. In contrast, we propose to constrain non-local feature augmentation within a pair of lines: each point only attends the corresponding pair of epipolar lines. Our idea takes inspiration from the classic epipolar geometry, which shows that one point with different depth hypotheses will be projected to the epipolar line on the other view. This constraint reduces the 2D search space into the epipolar line in stereo matching. Similarly, this suggests that the matching of MVS is to distinguish a series of points lying on the same line. Inspired by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
When Epipolar Constraint Meets Non-Local Operators in Multi-View Stereo· youtube
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Advanced Image and Video Retrieval Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Linear Layer · Dropout · Byte Pair Encoding · Label Smoothing · Absolute Position Encodings · Adam · Softmax
