Geometry Reinforced Efficient Attention Tuning Equipped with Normals for Robust Stereo Matching

Jiahao Li; Xinhong Chen; Zhengmin Jiang; Cheng Huang; Yung-Hui Li; Jianping Wang

arXiv:2604.09142·cs.CV·April 13, 2026

Geometry Reinforced Efficient Attention Tuning Equipped with Normals for Robust Stereo Matching

Jiahao Li, Xinhong Chen, Zhengmin Jiang, Cheng Huang, Yung-Hui Li, Jianping Wang

PDF

TL;DR

GREATEN is a novel stereo matching framework that leverages surface normals and sparse attention to improve cross-domain generalization and efficiency, especially in challenging non-Lambertian regions.

Contribution

It introduces a geometry-aware, normal-guided fusion approach with augmentation and sparse attention to enhance synthetic-to-real stereo matching performance.

Findings

01

Reduces errors by 30% on ETH3D

02

Achieves 8.5% improvement on non-Lambertian Booster

03

Runs 19.2% faster than previous methods

Abstract

Despite remarkable advances in image-driven stereo matching over the past decade, Synthetic-to-Realistic Zero-Shot (Syn-to-Real) generalization remains an open challenge. This suboptimal generalization performance mainly stems from cross-domain shifts and ill-posed ambiguities inherent in image textures, particularly in occluded, textureless, repetitive, and non-Lambertian (specular/transparent) regions. To improve Syn-to-Real generalization, we propose GREATEN, a framework that incorporates surface normals as domain-invariant, object-intrinsic, and discriminative geometric cues to compensate for the limitations of image textures. The proposed framework consists of three key components. First, a Gated Contextual-Geometric Fusion (GCGF) module adaptively suppresses unreliable contextual cues in image features and fuses the filtered image features with normal-driven geometric features to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.