CADDY Underwater Stereo-Vision Dataset for Human-Robot Interaction (HRI) in the Context of Diver Activities
Arturo Gomez Chavez, Andrea Ranieri, Davide Chiarella, Enrica Zereik,, Anja Babi\'c, Andreas Birk

TL;DR
This paper introduces the CADDY underwater stereo-vision dataset, capturing diver gestures and movements for advancing object detection, segmentation, and human pose estimation in challenging underwater environments.
Contribution
It provides one of the first large-scale underwater datasets with stereo images, IMU data, and annotations for human-robot interaction research.
Findings
Dataset enables testing robustness of detection algorithms underwater
Stereo images facilitate 3D reasoning in low-texture environments
Includes synchronized IMU data for human pose estimation
Abstract
In this article we present a novel underwater dataset collected from several field trials within the EU FP7 project "Cognitive autonomous diving buddy (CADDY)", where an Autonomous Underwater Vehicle (AUV) was used to interact with divers and monitor their activities. To our knowledge, this is one of the first efforts to collect a large dataset in underwater environments targeting object classification, segmentation and human pose estimation tasks. The first part of the dataset contains stereo camera recordings (~10K) of divers performing hand gestures to communicate and interact with an AUV in different environmental conditions. These gestures samples serve to test the robustness of object detection and classification algorithms against underwater image distortions i.e., color attenuation and light backscatter. The second part includes stereo footage (~12.7K) of divers free-swimming in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
