Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation
Yanbo Wang, Zipeng Fang, Lei Zhao, Weidong Chen

TL;DR
LE-Nav is a novel navigation framework that uses large language models and variational autoencoders to adaptively tune robot navigation parameters, improving performance and social acceptance in diverse real-world environments.
Contribution
It introduces a scene-aware, interpretable navigation system that leverages LLM reasoning and CVAE-based adaptation for zero-shot hyperparameter tuning in unstructured settings.
Findings
Achieves human-level hyperparameter tuning across various scenarios.
Outperforms state-of-the-art methods on success rate, efficiency, safety, and comfort.
Receives higher subjective scores for safety and social acceptance.
Abstract
Service robots are increasingly deployed in diverse and dynamic environments, where both physical layouts and social contexts change over time and across locations. In these unstructured settings, conventional navigation systems that rely on fixed parameters often fail to generalize across scenarios, resulting in degraded performance and reduced social acceptance. Although recent approaches have leveraged reinforcement learning to enhance traditional planners, these methods often fail in real-world deployments due to poor generalization and limited simulation diversity, which hampers effective sim-to-real transfer. To tackle these issues, we present LE-Nav, an interpretable and scene-aware navigation framework that leverages multi-modal large language model reasoning and conditional variational autoencoders to adaptively tune planner hyperparameters. To achieve zero-shot scene…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Speech and dialogue systems · AI-based Problem Solving and Planning
