Depth-aware Volume Attention for Texture-less Stereo Matching
Tong Zhao, Mingyu Ding, Wei Zhan, Masayoshi Tomizuka, Yintao Wei

TL;DR
This paper introduces a depth-aware volume attention mechanism to improve stereo matching in texture-less scenarios, achieving state-of-the-art results by emphasizing local structures and hierarchical filtering.
Contribution
It proposes a novel lightweight volume refinement scheme with depth-aware hierarchy and target-aware disparity attention modules for better texture-less stereo matching.
Findings
Achieves state-of-the-art performance on public datasets.
Excels in texture-less and ambiguous regions.
Provides a new depth-wise relative error evaluation metric.
Abstract
Stereo matching plays a crucial role in 3D perception and scenario understanding. Despite the proliferation of promising methods, addressing texture-less and texture-repetitive conditions remains challenging due to the insufficient availability of rich geometric and semantic information. In this paper, we propose a lightweight volume refinement scheme to tackle the texture deterioration in practical outdoor scenarios. Specifically, we introduce a depth volume supervised by the ground-truth depth map, capturing the relative hierarchy of image texture. Subsequently, the disparity discrepancy volume undergoes hierarchical filtering through the incorporation of depth-aware hierarchy attention and target-aware disparity attention modules. Local fine structure and context are emphasized to mitigate ambiguity and redundancy during volume aggregation. Furthermore, we propose a more rigorous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image and Video Stabilization · Advanced Image and Video Retrieval Techniques
