UMI-Underwater: Learning Underwater Manipulation without Underwater Teleoperation

Hao Li; Long Yin Chung; Jack Goler; Ryan Zhang; Xiaochi Xie; Huy Ha; Shuran Song; Mark Cutkosky

arXiv:2603.27012·cs.RO·March 31, 2026

UMI-Underwater: Learning Underwater Manipulation without Underwater Teleoperation

Hao Li, Long Yin Chung, Jack Goler, Ryan Zhang, Xiaochi Xie, Huy Ha, Shuran Song, Mark Cutkosky

PDF

1 Repo

TL;DR

This paper presents a self-supervised system for underwater robotic grasping that transfers knowledge from on-land demonstrations, improving robustness and generalization in underwater environments.

Contribution

It introduces a novel domain transfer method using depth-based affordance representations and a diffusion policy trained on underwater data, enabling zero-shot deployment.

Findings

01

Improved grasping success rate in underwater experiments

02

Enhanced robustness to background and lighting variations

03

Generalization to objects only seen in on-land data

Abstract

Underwater robotic grasping is difficult due to degraded, highly variable imagery and the expense of collecting diverse underwater demonstrations. We introduce a system that (i) autonomously collects successful underwater grasp demonstrations via a self-supervised data collection pipeline and (ii) transfers grasp knowledge from on-land human demonstrations through a depth-based affordance representation that bridges the on-land-to-underwater domain gap and is robust to lighting and color shift. An affordance model trained on on-land handheld demonstrations is deployed underwater zero-shot via geometric alignment, and an affordance-conditioned diffusion policy is then trained on underwater demonstrations to generate control actions. In pool experiments, our approach improves grasping performance and robustness to background shifts, and enables generalization to objects seen only in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://umi-under-water.github.io
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.