Depth-aware Volume Attention for Texture-less Stereo Matching

Tong Zhao; Mingyu Ding; Wei Zhan; Masayoshi Tomizuka; Yintao Wei

arXiv:2402.08931·cs.CV·February 28, 2024·2 cites

Depth-aware Volume Attention for Texture-less Stereo Matching

Tong Zhao, Mingyu Ding, Wei Zhan, Masayoshi Tomizuka, Yintao Wei

PDF

Open Access 1 Repo

TL;DR

This paper introduces a depth-aware volume attention mechanism to improve stereo matching in texture-less scenarios, achieving state-of-the-art results by emphasizing local structures and hierarchical filtering.

Contribution

It proposes a novel lightweight volume refinement scheme with depth-aware hierarchy and target-aware disparity attention modules for better texture-less stereo matching.

Findings

01

Achieves state-of-the-art performance on public datasets.

02

Excels in texture-less and ambiguous regions.

03

Provides a new depth-wise relative error evaluation metric.

Abstract

Stereo matching plays a crucial role in 3D perception and scenario understanding. Despite the proliferation of promising methods, addressing texture-less and texture-repetitive conditions remains challenging due to the insufficient availability of rich geometric and semantic information. In this paper, we propose a lightweight volume refinement scheme to tackle the texture deterioration in practical outdoor scenarios. Specifically, we introduce a depth volume supervised by the ground-truth depth map, capturing the relative hierarchy of image texture. Subsequently, the disparity discrepancy volume undergoes hierarchical filtering through the incorporation of depth-aware hierarchy attention and target-aware disparity attention modules. Local fine structure and context are emphasized to mitigate ambiguity and redundancy during volume aggregation. Furthermore, we propose a more rigorous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ztsrxh/DVANet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image and Video Stabilization · Advanced Image and Video Retrieval Techniques