DiffuVolume: Diffusion Model for Volume based Stereo Matching

Dian Zheng; Xiao-Ming Wu; Zuhao Liu; Jingke Meng; Wei-shi Zheng

arXiv:2308.15989·cs.CV·August 31, 2023·6 cites

DiffuVolume: Diffusion Model for Volume based Stereo Matching

Dian Zheng, Xiao-Ming Wu, Zuhao Liu, Jingke Meng, Wei-shi Zheng

PDF

Open Access

TL;DR

DiffuVolume introduces a diffusion model-based approach to refine cost volumes in stereo matching, significantly improving accuracy and inference speed, and achieving top rankings on benchmark datasets.

Contribution

The paper pioneers applying diffusion models as cost volume filters in stereo matching, enhancing performance and efficiency with easy integration into existing networks.

Findings

01

22% EPE improvement over traditional methods

02

240 times faster inference speed

03

Ranked 1st on KITTI 2012 leaderboard

Abstract

Stereo matching is a significant part in many computer vision tasks and driving-based applications. Recently cost volume-based methods have achieved great success benefiting from the rich geometry information in paired images. However, the redundancy of cost volume also interferes with the model training and limits the performance. To construct a more precise cost volume, we pioneeringly apply the diffusion model to stereo matching. Our method, termed DiffuVolume, considers the diffusion model as a cost volume filter, which will recurrently remove the redundant information from the cost volume. Two main designs make our method not trivial. Firstly, to make the diffusion model more adaptive to stereo matching, we eschew the traditional manner of directly adding noise into the image but embed the diffusion model into a task-specific module. In this way, we outperform the traditional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Advanced Image Processing Techniques

MethodsDiffusion