Training Robots without Robots: Deep Imitation Learning for Master-to-Robot Policy Transfer
Heecheol Kim, Yoshiyuki Ohmura, Akihiko Nagakubo, and Yasuo Kuniyoshi

TL;DR
This paper introduces a novel master-to-robot policy transfer system that enables force feedback-based manipulation learning without using actual robots for demonstration, utilizing gaze-based imitation and Transformer models.
Contribution
The study presents a new M2R system that bypasses traditional robot demonstrations, employing a human-controlled device with force sensing and advanced inference techniques.
Findings
Successfully transferred manipulation policies without robot demonstrations.
Achieved effective force feedback learning in a bottle-cap-opening task.
Overcame domain gaps using gaze-based imitation and calibration.
Abstract
Deep imitation learning is promising for robot manipulation because it only requires demonstration samples. In this study, deep imitation learning is applied to tasks that require force feedback. However, existing demonstration methods have deficiencies; bilateral teleoperation requires a complex control scheme and is expensive, and kinesthetic teaching suffers from visual distractions from human intervention. This research proposes a new master-to-robot (M2R) policy transfer system that does not require robots for teaching force feedback-based manipulation tasks. The human directly demonstrates a task using a controller. This controller resembles the kinematic parameters of the robot arm and uses the same end-effector with force/torque (F/T) sensors to measure the force feedback. Using this controller, the operator can feel force feedback without a bilateral system. The proposed method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
