Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation

Yanbo Wang; Zipeng Fang; Lei Zhao; Weidong Chen

arXiv:2507.11001·cs.RO·July 16, 2025

Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation

Yanbo Wang, Zipeng Fang, Lei Zhao, Weidong Chen

PDF

Open Access 1 Repo

TL;DR

LE-Nav is a novel navigation framework that uses large language models and variational autoencoders to adaptively tune robot navigation parameters, improving performance and social acceptance in diverse real-world environments.

Contribution

It introduces a scene-aware, interpretable navigation system that leverages LLM reasoning and CVAE-based adaptation for zero-shot hyperparameter tuning in unstructured settings.

Findings

01

Achieves human-level hyperparameter tuning across various scenarios.

02

Outperforms state-of-the-art methods on success rate, efficiency, safety, and comfort.

03

Receives higher subjective scores for safety and social acceptance.

Abstract

Service robots are increasingly deployed in diverse and dynamic environments, where both physical layouts and social contexts change over time and across locations. In these unstructured settings, conventional navigation systems that rely on fixed parameters often fail to generalize across scenarios, resulting in degraded performance and reduced social acceptance. Although recent approaches have leveraged reinforcement learning to enhance traditional planners, these methods often fail in real-world deployments due to poor generalization and limited simulation diversity, which hampers effective sim-to-real transfer. To tackle these issues, we present LE-Nav, an interpretable and scene-aware navigation framework that leverages multi-modal large language model reasoning and conditional variational autoencoders to adaptively tune planner hyperparameters. To achieve zero-shot scene…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Cavendish518/LE-Nav
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Speech and dialogue systems · AI-based Problem Solving and Planning