Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation

Tharun Sethuraman; Subham Agrawal; Nils Dengler; Jorge de Heuvel; Teena Hassan; Maren Bennewitz

arXiv:2603.17510·cs.RO·May 13, 2026

Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation

Tharun Sethuraman, Subham Agrawal, Nils Dengler, Jorge de Heuvel, Teena Hassan, Maren Bennewitz

PDF

TL;DR

This paper introduces a pipeline that enables robots to interpret and adapt to human preferences in real-time navigation by combining vision-language models, large language models, and multi-objective reinforcement learning.

Contribution

It presents a novel integration of semantic reasoning and low-level control allowing robots to understand natural language preferences in context and adapt their navigation accordingly.

Findings

01

System reliably captures user intent and preferences.

02

Enables real-time, context-dependent navigation adaptation.

03

Improves robot transparency and usability in human environments.

Abstract

Robots operating in human-shared environments must not only achieve task-level navigation objectives such as safety and efficiency, but also adapt their behavior to human preferences. However, as human preferences are typically expressed in natural language and depend on environmental context, it is difficult to directly integrate them into low-level robot control policies. In this work, we present a pipeline that enables robots to understand and apply context-dependent navigation preferences by combining foundational models with a Multi-Objective Reinforcement Learning (MORL) navigation policy. Thus, our approach integrates high-level semantic reasoning with low-level motion control. A Vision-Language Model (VLM) extracts structured environmental context from onboard visual observations, while Large Language Models (LLM) convert natural language user feedback into interpretable,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.