Behavior Imitation for Manipulator Control and Grasping with Deep Reinforcement Learning
Liu Qiyuan

TL;DR
This paper introduces a lightweight reinforcement learning approach that uses 3D human pose estimation to enable robotic manipulators to imitate human arm motions from minimal video data, reducing the need for extensive expert datasets.
Contribution
It combines 3D pose estimation with reinforcement learning to simplify motion imitation into a joint angle prediction task, requiring only seconds of video for training.
Findings
Effective imitation of human arm motions from brief videos
Strong generalization to unfamiliar videos
Reduces reliance on costly motion capture data
Abstract
The existing Motion Imitation models typically require expert data obtained through MoCap devices, but the vast amount of training data needed is difficult to acquire, necessitating substantial investments of financial resources, manpower, and time. This project combines 3D human pose estimation with reinforcement learning, proposing a novel model that simplifies Motion Imitation into a prediction problem of joint angle values in reinforcement learning. This significantly reduces the reliance on vast amounts of training data, enabling the agent to learn an imitation policy from just a few seconds of video and exhibit strong generalization capabilities. It can quickly apply the learned policy to imitate human arm motions in unfamiliar videos. The model first extracts skeletal motions of human arms from a given video using 3D human pose estimation. These extracted arm motions are then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Reinforcement Learning in Robotics
