Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual   and Action Representations

Puhao Li; Tengyu Liu; Yuyang Li; Muzhi Han; Haoran Geng; Shu Wang,; Yixin Zhu; Song-Chun Zhu; Siyuan Huang

arXiv:2404.17521·cs.RO·April 29, 2024

Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations

Puhao Li, Tengyu Liu, Yuyang Li, Muzhi Han, Haoran Geng, Shu Wang,, Yixin Zhu, Song-Chun Zhu, Siyuan Huang

PDF

Open Access 1 Repo

TL;DR

Ag2Manip introduces agent-agnostic visual and action representations to improve robotic manipulation learning, achieving significant performance gains in simulation and real-world tasks without domain-specific demonstrations.

Contribution

The paper presents a novel framework with agent-agnostic visual and action representations that enhance generalization and performance in robotic manipulation tasks.

Findings

01

325% performance increase in simulated benchmarks

02

Imitation learning success rate improved from 50% to 77.5% in real-world tests

03

Effective across both simulated and physical environments

Abstract

Autonomous robotic systems capable of learning novel manipulation tasks are poised to transform industries from manufacturing to service automation. However, modern methods (e.g., VIP and R3M) still face significant hurdles, notably the domain gap among robotic embodiments and the sparsity of successful task executions within specific action spaces, resulting in misaligned and ambiguous task representations. We introduce Ag2Manip (Agent-Agnostic representations for Manipulation), a framework aimed at surmounting these challenges through two key innovations: a novel agent-agnostic visual representation derived from human manipulation videos, with the specifics of embodiments obscured to enhance generalizability; and an agent-agnostic action representation abstracting a robot's kinematics to a universal agent proxy, emphasizing crucial interactions between end-effector and object.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Xiaoyao-Li/Ag2Manip
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation

Methodstravel james