Sample-Efficient Training of Robotic Guide Using Human Path Prediction   Network

Hee-Seung Moon; Jiwon Seo

arXiv:2008.05054·cs.RO·September 29, 2022

Sample-Efficient Training of Robotic Guide Using Human Path Prediction Network

Hee-Seung Moon, Jiwon Seo

PDF

Open Access

TL;DR

This paper introduces a human path prediction network and an evolution-strategy-based training method that enables efficient training of a robotic guide for visually impaired people using virtual human trajectories, reducing the need for extensive real-world data.

Contribution

It presents a novel human path prediction network combined with a virtual training approach, significantly reducing data requirements for robot training in human-interactive scenarios.

Findings

01

Trained the HPPN with only 1.5K real episodes to generate 100K virtual episodes.

02

The robot successfully guided blindfolded participants along a target path.

03

Virtual episodes enabled exploration of new reward designs prioritizing human comfort.

Abstract

Training a robot that engages with people is challenging; it is expensive to directly involve people in the training process, which requires numerous data samples. This paper presents an alternative approach for resolving this problem. We propose a human path prediction network (HPPN) that generates a user's future trajectory based on sequential robot actions and human responses using a recurrent-neural-network structure. Subsequently, an evolution-strategy-based robot training method using only the virtual human movements generated using the HPPN is presented. It is demonstrated that our proposed method permits sample-efficient training of a robotic guide for visually impaired people. By collecting only 1.5 K episodes from real users, we were able to train the HPPN and generate more than 100 K virtual episodes required for training the robot. The trained robot precisely guided…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Hand Gesture Recognition Systems · Robotics and Automated Systems