Teaching Robots Like Dogs: Learning Agile Navigation from Luring, Gesture, and Speech

Taerim Yoon; Dongho Kang; Jin Cheng; Fatemeh Zargarbashi; Yijiang Huang; Minsung Ahn; Stelian Coros; and Sungjoon Choi

arXiv:2601.08422·cs.RO·January 22, 2026

Teaching Robots Like Dogs: Learning Agile Navigation from Luring, Gesture, and Speech

Taerim Yoon, Dongho Kang, Jin Cheng, Fatemeh Zargarbashi, Yijiang Huang, Minsung Ahn, Stelian Coros, and Sungjoon Choi

PDF

Open Access

TL;DR

This paper presents a data-efficient, multimodal human-in-the-loop framework enabling legged robots to learn agile navigation from minimal demonstrations, guided by natural human gestures and speech.

Contribution

It introduces a novel framework combining simulation, data aggregation, and adaptive goal cueing for efficient robot navigation learning from limited human demonstrations.

Findings

01

Achieved 97.15% success rate in real-world navigation tasks.

02

Learned from less than 1 hour of demonstration data.

03

Successfully handled complex scenarios like obstacle jumping.

Abstract

In this work, we aim to enable legged robots to learn how to interpret human social cues and produce appropriate behaviors through physical human guidance. However, learning through physical engagement can place a heavy burden on users when the process requires large amounts of human-provided data. To address this, we propose a human-in-the-loop framework that enables robots to acquire navigational behaviors in a data-efficient manner and to be controlled via multimodal natural human inputs, specifically gestural and verbal commands. We reconstruct interaction scenes using a physics-based simulation and aggregate data to mitigate distributional shifts arising from limited demonstration data. Our progressive goal cueing strategy adaptively feeds appropriate commands and navigation goals during training, leading to more accurate navigation and stronger alignment between human input and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSocial Robot Interaction and HRI · Robotic Locomotion and Control · Human Motion and Animation