TL;DR
This paper presents a self-supervised system for underwater robotic grasping that transfers knowledge from on-land demonstrations, improving robustness and generalization in underwater environments.
Contribution
It introduces a novel domain transfer method using depth-based affordance representations and a diffusion policy trained on underwater data, enabling zero-shot deployment.
Findings
Improved grasping success rate in underwater experiments
Enhanced robustness to background and lighting variations
Generalization to objects only seen in on-land data
Abstract
Underwater robotic grasping is difficult due to degraded, highly variable imagery and the expense of collecting diverse underwater demonstrations. We introduce a system that (i) autonomously collects successful underwater grasp demonstrations via a self-supervised data collection pipeline and (ii) transfers grasp knowledge from on-land human demonstrations through a depth-based affordance representation that bridges the on-land-to-underwater domain gap and is robust to lighting and color shift. An affordance model trained on on-land handheld demonstrations is deployed underwater zero-shot via geometric alignment, and an affordance-conditioned diffusion policy is then trained on underwater demonstrations to generate control actions. In pool experiments, our approach improves grasping performance and robustness to background shifts, and enables generalization to objects seen only in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
