Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation Graph
Ziyang Chen, Yongjun Zhang, Wenting Li, Bingshu Wang, Yong Zhao, C. L., Philip Chen

TL;DR
MoCha-V2 introduces the Motif Correlation Graph to enhance geometric structure preservation in stereo matching, improving interpretability and accuracy, and achieving top performance on the Middlebury benchmark.
Contribution
The paper presents MoCha-V2, a novel stereo matching framework that uses Motif Correlation Graphs for interpretable geometric structure modeling and multi-frequency feature integration.
Findings
Achieved 1st place on Middlebury benchmark.
Effectively captures recurring textures for better geometric reconstruction.
Improves interpretability of stereo matching models.
Abstract
Real-world applications of stereo matching, such as autonomous driving, place stringent demands on both safety and accuracy. However, learning-based stereo matching methods inherently suffer from the loss of geometric structures in certain feature channels, creating a bottleneck in achieving precise detail matching. Additionally, these methods lack interpretability due to the black-box nature of deep learning. In this paper, we propose MoCha-V2, a novel learning-based paradigm for stereo matching. MoCha-V2 introduces the Motif Correlation Graph (MCG) to capture recurring textures, which are referred to as ``motifs" within feature channels. These motifs reconstruct geometric structures and are learned in a more interpretable way. Subsequently, we integrate features from multiple frequency domains through wavelet inverse transformation. The resulting motif features are utilized to restore…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
