S$^2$-Diffusion: Generalizing from Instance-level to Category-level Skills in Robot Manipulation
Quantao Yang, Michael C. Welle, Danica Kragic, and Olov Andersson

TL;DR
This paper introduces S$^2$-Diffusion, a novel robot manipulation method that generalizes skills from specific instances to entire categories using a semantic diffusion approach, enabling transferability and robustness with minimal data.
Contribution
The work presents a new open-vocabulary diffusion policy that generalizes manipulation skills from instance-level to category-level, incorporating semantic prompts and depth estimation for single RGB input.
Findings
Enables transfer of manipulation skills across object instances within the same category.
Achieves robust performance in both simulation and real-world tasks.
Operates effectively with only a single RGB camera and depth estimation.
Abstract
Recent advances in skill learning has propelled robot manipulation to new heights by enabling it to learn complex manipulation tasks from a practical number of demonstrations. However, these skills are often limited to the particular action, object, and environment \textit{instances} that are shown in the training data, and have trouble transferring to other instances of the same category. In this work we present an open-vocabulary Spatial-Semantic Diffusion policy (S-Diffusion) which enables generalization from instance-level training data to category-level, enabling skills to be transferable between instances of the same category. We show that functional aspects of skills can be captured via a promptable semantic module combined with a spatial representation. We further propose leveraging depth estimation networks to allow the use of only a single RGB camera. Our approach is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning
MethodsDiffusion
