A Practical Approach to Underwater Depth and Surface Normals Estimation
Alzayat Saleh, Melanie Olsen, Bouchra Senadji, Mostafa Rahimi Azghadi

TL;DR
This paper introduces a hybrid deep learning model combining CNNs and Transformers for underwater depth and surface normal estimation, addressing dataset noise and computational efficiency to enable real-time 3D perception in resource-limited underwater robots.
Contribution
It presents a novel hybrid architecture, the DNESA algorithm for data quality, and a lightweight training pipeline for real-time underwater 3D perception.
Findings
Model reduces parameters by 90%
Training costs decreased by 80%
Enables real-time underwater perception
Abstract
Monocular Depth and Surface Normals Estimation (MDSNE) is crucial for tasks such as 3D reconstruction, autonomous navigation, and underwater exploration. Current methods rely either on discriminative models, which struggle with transparent or reflective surfaces, or generative models, which, while accurate, are computationally expensive. This paper presents a novel deep learning model for MDSNE, specifically tailored for underwater environments, using a hybrid architecture that integrates Convolutional Neural Networks (CNNs) with Transformers, leveraging the strengths of both approaches. Training effective MDSNE models is often hampered by noisy real-world datasets and the limited generalization of synthetic datasets. To address this, we generate pseudo-labeled real data using multiple pre-trained MDSNE models. To ensure the quality of this data, we propose the Depth Normal Evaluation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUnderwater Acoustics Research
MethodsFocus
