HiCrowd: Hierarchical Crowd Flow Alignment for Dense Human Environments
Yufei Zhu, Shih-Min Yang, Martin Magnusson, Allan Wang

TL;DR
HiCrowd is a hierarchical framework combining reinforcement learning and model predictive control that enables mobile robots to navigate dense crowds safely and efficiently by aligning with pedestrian flows and reducing freezing behavior.
Contribution
The paper introduces HiCrowd, a novel hierarchical approach integrating RL and MPC for crowd-aware robot navigation, leveraging human motion as guidance rather than obstacles.
Findings
Outperforms reactive and learning-based baselines in navigation efficiency.
Reduces freezing behaviors in dense crowds.
Effective in both offline and online simulation settings.
Abstract
Navigating through dense human crowds remains a significant challenge for mobile robots. A key issue is the freezing robot problem, where the robot struggles to find safe motions and becomes stuck within the crowd. To address this, we propose HiCrowd, a hierarchical framework that integrates reinforcement learning (RL) with model predictive control (MPC). HiCrowd leverages surrounding pedestrian motion as guidance, enabling the robot to align with compatible crowd flows. A high-level RL policy generates a follow point to align the robot with a suitable pedestrian group, while a low-level MPC safely tracks this guidance with short horizon planning. The method combines long-term crowd aware decision making with safe short-term execution. We evaluate HiCrowd against reactive and learning-based baselines in offline setting (replaying recorded human trajectories) and online setting (human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Evacuation and Crowd Dynamics · Reinforcement Learning in Robotics
