Visual IRL for Human-Like Robotic Manipulation
Ehsan Asali, Prashant Doshi

TL;DR
This paper introduces Visual IRL, a method enabling robots to learn human-like manipulation tasks from observation by mapping human kinematics to robot actions, enhancing naturalness and efficiency in industrial settings.
Contribution
The paper presents a novel neuro-symbolic dynamics model and a Visual IRL framework that transfer human manipulation skills to robots using RGB-D keypoints and inverse reinforcement learning.
Findings
Effective transfer of human manipulation skills to robots.
Improved human-robot compatibility in manufacturing tasks.
Successful demonstration on onion processing and liquid pouring tasks.
Abstract
We present a novel method for collaborative robots (cobots) to learn manipulation tasks and perform them in a human-like manner. Our method falls under the learn-from-observation (LfO) paradigm, where robots learn to perform tasks by observing human actions, which facilitates quicker integration into industrial settings compared to programming from scratch. We introduce Visual IRL that uses the RGB-D keypoints in each frame of the observed human task performance directly as state features, which are input to inverse reinforcement learning (IRL). The inversely learned reward function, which maps keypoints to reward values, is transferred from the human to the cobot using a novel neuro-symbolic dynamics model, which maps human kinematics to the cobot arm. This model allows similar end-effector positioning while minimizing joint adjustments, aiming to preserve the natural dynamics of human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotics and Automated Systems
MethodsFocus
