CADDY Underwater Stereo-Vision Dataset for Human-Robot Interaction (HRI)   in the Context of Diver Activities

Arturo Gomez Chavez; Andrea Ranieri; Davide Chiarella; Enrica Zereik,; Anja Babi\'c; Andreas Birk

arXiv:1807.04856·cs.CV·July 16, 2018

CADDY Underwater Stereo-Vision Dataset for Human-Robot Interaction (HRI) in the Context of Diver Activities

Arturo Gomez Chavez, Andrea Ranieri, Davide Chiarella, Enrica Zereik,, Anja Babi\'c, Andreas Birk

PDF

TL;DR

This paper introduces the CADDY underwater stereo-vision dataset, capturing diver gestures and movements for advancing object detection, segmentation, and human pose estimation in challenging underwater environments.

Contribution

It provides one of the first large-scale underwater datasets with stereo images, IMU data, and annotations for human-robot interaction research.

Findings

01

Dataset enables testing robustness of detection algorithms underwater

02

Stereo images facilitate 3D reasoning in low-texture environments

03

Includes synchronized IMU data for human pose estimation

Abstract

In this article we present a novel underwater dataset collected from several field trials within the EU FP7 project "Cognitive autonomous diving buddy (CADDY)", where an Autonomous Underwater Vehicle (AUV) was used to interact with divers and monitor their activities. To our knowledge, this is one of the first efforts to collect a large dataset in underwater environments targeting object classification, segmentation and human pose estimation tasks. The first part of the dataset contains stereo camera recordings (~10K) of divers performing hand gestures to communicate and interact with an AUV in different environmental conditions. These gestures samples serve to test the robustness of object detection and classification algorithms against underwater image distortions i.e., color attenuation and light backscatter. The second part includes stereo footage (~12.7K) of divers free-swimming in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.