Teaching Robots Like Dogs: Learning Agile Navigation from Luring, Gesture, and Speech
Taerim Yoon, Dongho Kang, Jin Cheng, Fatemeh Zargarbashi, Yijiang Huang, Minsung Ahn, Stelian Coros, and Sungjoon Choi

TL;DR
This paper presents a data-efficient, multimodal human-in-the-loop framework enabling legged robots to learn agile navigation from minimal demonstrations, guided by natural human gestures and speech.
Contribution
It introduces a novel framework combining simulation, data aggregation, and adaptive goal cueing for efficient robot navigation learning from limited human demonstrations.
Findings
Achieved 97.15% success rate in real-world navigation tasks.
Learned from less than 1 hour of demonstration data.
Successfully handled complex scenarios like obstacle jumping.
Abstract
In this work, we aim to enable legged robots to learn how to interpret human social cues and produce appropriate behaviors through physical human guidance. However, learning through physical engagement can place a heavy burden on users when the process requires large amounts of human-provided data. To address this, we propose a human-in-the-loop framework that enables robots to acquire navigational behaviors in a data-efficient manner and to be controlled via multimodal natural human inputs, specifically gestural and verbal commands. We reconstruct interaction scenes using a physics-based simulation and aggregate data to mitigate distributional shifts arising from limited demonstration data. Our progressive goal cueing strategy adaptively feeds appropriate commands and navigation goals during training, leading to more accurate navigation and stronger alignment between human input and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Robotic Locomotion and Control · Human Motion and Animation
