Learning Implicit Social Navigation Behavior using Deep Inverse   Reinforcement Learning

Tribhi Kathuria; Ke Liu; Junwoo Jang; X. Jessie Yang; and Maani; Ghaffari

arXiv:2501.06946·cs.RO·January 14, 2025

Learning Implicit Social Navigation Behavior using Deep Inverse Reinforcement Learning

Tribhi Kathuria, Ke Liu, Junwoo Jang, X. Jessie Yang, and Maani, Ghaffari

PDF

TL;DR

This paper introduces S-MEDIRL, a deep inverse reinforcement learning algorithm that enables robots to learn social navigation behaviors by predicting cost maps from trajectory data and scene geometry, outperforming rule-based methods.

Contribution

The paper presents a novel deep inverse reinforcement learning approach for social navigation that extrapolates beyond demonstrations to encode scene navigability and social behaviors.

Findings

01

Robot learns to yield and avoid deadlocks in simulation

02

S-MEDIRL outperforms ORCA and rule-based agents

03

Effective in narrow crossing scenarios

Abstract

This paper reports on learning a reward map for social navigation in dynamic environments where the robot can reason about its path at any time, given agents' trajectories and scene geometry. Humans navigating in dense and dynamic indoor environments often work with several implied social rules. A rule-based approach fails to model all possible interactions between humans, robots, and scenes. We propose a novel Smooth Maximum Entropy Deep Inverse Reinforcement Learning (S-MEDIRL) algorithm that can extrapolate beyond expert demos to better encode scene navigability from few-shot demonstrations. The agent learns to predict the cost maps reasoning on trajectory data and scene geometry. The agent samples a trajectory that is then executed using a local crowd navigation controller. We present results in a photo-realistic simulation environment, with a robot and a human navigating a narrow…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.