Sample-Efficient Training of Robotic Guide Using Human Path Prediction Network
Hee-Seung Moon, Jiwon Seo

TL;DR
This paper introduces a human path prediction network and an evolution-strategy-based training method that enables efficient training of a robotic guide for visually impaired people using virtual human trajectories, reducing the need for extensive real-world data.
Contribution
It presents a novel human path prediction network combined with a virtual training approach, significantly reducing data requirements for robot training in human-interactive scenarios.
Findings
Trained the HPPN with only 1.5K real episodes to generate 100K virtual episodes.
The robot successfully guided blindfolded participants along a target path.
Virtual episodes enabled exploration of new reward designs prioritizing human comfort.
Abstract
Training a robot that engages with people is challenging; it is expensive to directly involve people in the training process, which requires numerous data samples. This paper presents an alternative approach for resolving this problem. We propose a human path prediction network (HPPN) that generates a user's future trajectory based on sequential robot actions and human responses using a recurrent-neural-network structure. Subsequently, an evolution-strategy-based robot training method using only the virtual human movements generated using the HPPN is presented. It is demonstrated that our proposed method permits sample-efficient training of a robotic guide for visually impaired people. By collecting only 1.5 K episodes from real users, we were able to train the HPPN and generate more than 100 K virtual episodes required for training the robot. The trained robot precisely guided…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Hand Gesture Recognition Systems · Robotics and Automated Systems
