Uni-Skill: Building Self-Evolving Skill Repository for Generalizable Robotic Manipulation
Senwei Xie, Yuntian Zhang, Ruiping Wang, Xilin Chen

TL;DR
Uni-Skill introduces an adaptive, self-evolving skill repository for robotic manipulation, enabling flexible planning and zero-shot generalization through automatic skill annotation and hierarchical taxonomy.
Contribution
It presents Uni-Skill, a framework that automatically evolves and retrieves skills from a large-scale repository, improving adaptability and generalization in robotic manipulation tasks.
Findings
Achieves state-of-the-art zero-shot generalization in manipulation tasks.
Demonstrates effective skill inference without deployment-time demonstrations.
Outperforms existing VLM-based approaches in simulation and real-world tests.
Abstract
While skill-centric approaches leverage foundation models to enhance generalization in compositional tasks, they often rely on fixed skill libraries, limiting adaptability to new tasks without manual intervention. To address this, we propose Uni-Skill, a Unified Skill-centric framework that supports skill-aware planning and facilitates automatic skill evolution. Unlike prior methods that restrict planning to predefined skills, Uni-Skill requests for new skill implementations when existing ones are insufficient, ensuring adaptable planning with self-augmented skill library. To support automatic implementation of diverse skills requested by the planning module, we construct SkillFolder, a VerbNet-inspired repository derived from large-scale unstructured robotic videos. SkillFolder introduces a hierarchical skill taxonomy that captures diverse skill descriptions at multiple levels of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Multimodal Machine Learning Applications
