StereoAdapter-2: Globally Structure-Consistent Underwater Stereo Depth Estimation
Zeyu Ren, Xiang Li, Yiran Wang, Zeyu Zhang, Hao Tang

TL;DR
StereoAdapter-2 introduces a novel operator for underwater stereo depth estimation that captures long-range spatial structure efficiently, significantly improving zero-shot performance on underwater benchmarks and demonstrating robustness in real-world tests.
Contribution
It proposes a new ConvSS2D operator with a four-directional scanning strategy for better long-range disparity propagation in underwater stereo depth estimation.
Findings
Achieves 17% improvement on TartanAir-UW benchmark.
Attains 7.2% improvement on SQUID benchmark.
Demonstrates robustness in real-world underwater robot tests.
Abstract
Stereo depth estimation is fundamental to underwater robotic perception, yet suffers from severe domain shifts caused by wavelength-dependent light attenuation, scattering, and refraction. Recent approaches leverage monocular foundation models with GRU-based iterative refinement for underwater adaptation; however, the sequential gating and local convolutional kernels in GRUs necessitate multiple iterations for long-range disparity propagation, limiting performance in large-disparity and textureless underwater regions. In this paper, we propose StereoAdapter-2, which replaces the conventional ConvGRU updater with a novel ConvSS2D operator based on selective state space models. The proposed operator employs a four-directional scanning strategy that naturally aligns with epipolar geometry while capturing vertical structural consistency, enabling efficient long-range spatial propagation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques
