Training Robots without Robots: Deep Imitation Learning for   Master-to-Robot Policy Transfer

Heecheol Kim; Yoshiyuki Ohmura; Akihiko Nagakubo; and Yasuo Kuniyoshi

arXiv:2202.09574·cs.RO·February 27, 2024·1 cites

Training Robots without Robots: Deep Imitation Learning for Master-to-Robot Policy Transfer

Heecheol Kim, Yoshiyuki Ohmura, Akihiko Nagakubo, and Yasuo Kuniyoshi

PDF

Open Access

TL;DR

This paper introduces a novel master-to-robot policy transfer system that enables force feedback-based manipulation learning without using actual robots for demonstration, utilizing gaze-based imitation and Transformer models.

Contribution

The study presents a new M2R system that bypasses traditional robot demonstrations, employing a human-controlled device with force sensing and advanced inference techniques.

Findings

01

Successfully transferred manipulation policies without robot demonstrations.

02

Achieved effective force feedback learning in a bottle-cap-opening task.

03

Overcame domain gaps using gaze-based imitation and calibration.

Abstract

Deep imitation learning is promising for robot manipulation because it only requires demonstration samples. In this study, deep imitation learning is applied to tasks that require force feedback. However, existing demonstration methods have deficiencies; bilateral teleoperation requires a complex control scheme and is expensive, and kinesthetic teaching suffers from visual distractions from human intervention. This research proposes a new master-to-robot (M2R) policy transfer system that does not require robots for teaching force feedback-based manipulation tasks. The human directly demonstrates a task using a controller. This controller resembles the kinematic parameters of the robot arm and uses the same end-effector with force/torque (F/T) sensors to measure the force feedback. Using this controller, the operator can feel force feedback without a bilateral system. The proposed method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics