LISN: Language-Instructed Social Navigation with VLM-based Controller Modulating
Junting Chen, Yunchuan Li, Panfeng Jiang, Jiacheng Du, Zixuan Chen, Chenrui Tie, Jiajun Deng, Lin Shao

TL;DR
This paper introduces LISN-Bench, a new simulation benchmark for language-guided social navigation, and proposes Social-Nav-Modulator, a hierarchical system that improves robot navigation by integrating language instructions with scene understanding.
Contribution
It presents the first benchmark for language-instructed social navigation and a hierarchical VLM-based controller that enhances dynamic avoidance and instruction following.
Findings
Achieved 91.3% success rate in social navigation tasks
Significantly outperformed baseline methods by over 63%
Improved performance in complex scenarios like crowd following and forbidden region avoidance
Abstract
Towards human-robot coexistence, socially aware navigation is significant for mobile robots. Yet existing studies on this area focus mainly on path efficiency and pedestrian collision avoidance, which are essential but represent only a fraction of social navigation. Beyond these basics, robots must also comply with user instructions, aligning their actions to task goals and social norms expressed by humans. In this work, we present LISN-Bench, the first simulation-based benchmark for language-instructed social navigation. Built on Rosnav-Arena 3.0, it is the first standardized social navigation benchmark to incorporate instruction following and scene understanding across diverse contexts. To address this task, we further propose Social-Nav-Modulator, a fast-slow hierarchical system where a VLM agent modulates costmaps and controller parameters. Decoupling low-level action generation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Social Robot Interaction and HRI · Autonomous Vehicle Technology and Safety
