Towards Training-Free Underwater 3D Object Detection from Sonar Point Clouds: A Comparison of Traditional and Deep Learning Approaches
M. Salman Shaukat, Yannik K\"ackenmeister, Sebastian Bader, Thomas Kirste

TL;DR
This paper compares training-free traditional and deep learning methods for underwater 3D object detection from sonar data, highlighting the robustness of template matching over neural networks trained on synthetic data in real-world conditions.
Contribution
It introduces and evaluates a physics-based synthetic data generation pipeline and a model-based template matching system for training-free underwater 3D detection, providing a large-scale benchmark.
Findings
Neural networks trained on synthetic data achieve 98% mAP in simulation but only 40% on real data.
Template matching maintains 83% mAP on real sonar data without training.
The study demonstrates the robustness of traditional methods over deep learning in data-scarce underwater environments.
Abstract
Underwater 3D object detection remains one of the most challenging frontiers in computer vision, where traditional approaches struggle with the harsh acoustic environment and scarcity of training data. While deep learning has revolutionized terrestrial 3D detection, its application underwater faces a critical bottleneck: obtaining sufficient annotated sonar data is prohibitively expensive and logistically complex, often requiring specialized vessels, expert surveyors, and favorable weather conditions. This work addresses a fundamental question: Can we achieve reliable underwater 3D object detection without real-world training data? We tackle this challenge by developing and comparing two paradigms for training-free detection of artificial structures in multibeam echo-sounder point clouds. Our dual approach combines a physics-based sonar simulation pipeline that generates synthetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
