Human-Inspired Multi-Agent Navigation using Knowledge Distillation

Pei Xu; Ioannis Karamouzas

arXiv:2103.10000·cs.RO·August 30, 2023

Human-Inspired Multi-Agent Navigation using Knowledge Distillation

Pei Xu, Ioannis Karamouzas

PDF

1 Repo

TL;DR

This paper introduces a novel framework for multi-agent navigation that employs knowledge distillation and reinforcement learning to enable agents to mimic human-like collision avoidance and goal-directed behaviors, surpassing previous methods.

Contribution

The paper presents a new approach combining knowledge distillation with reinforcement learning to teach agents human-like navigation skills in decentralized multi-agent environments.

Findings

01

Agents exhibit human-like trajectories in collision avoidance tasks.

02

Our method outperforms expert policies and non-distilled learning agents.

03

Agents generalize to new tasks not seen in demonstrations.

Abstract

Despite significant advancements in the field of multi-agent navigation, agents still lack the sophistication and intelligence that humans exhibit in multi-agent settings. In this paper, we propose a framework for learning a human-like general collision avoidance policy for agent-agent interactions in fully decentralized, multi-agent environments. Our approach uses knowledge distillation with reinforcement learning to shape the reward function based on expert policies extracted from human trajectory demonstrations through behavior cloning. We show that agents trained with our approach can take human-like trajectories in collision avoidance and goal-directed steering tasks not provided by the demonstrations, outperforming the experts as well as learning-based agents trained without knowledge distillation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xupei0610/KDMA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsKnowledge Distillation