OVITA: Open-Vocabulary Interpretable Trajectory Adaptations

Anurag Maurya; Tashmoy Ghosh; Anh Nguyen; Ravi Prakash

arXiv:2508.17260·cs.RO·August 26, 2025

OVITA: Open-Vocabulary Interpretable Trajectory Adaptations

Anurag Maurya, Tashmoy Ghosh, Anh Nguyen, Ravi Prakash

PDF

TL;DR

OVITA is a novel framework that uses large language models to enable natural language-based, interpretable, and flexible trajectory adaptation for robots in dynamic, unstructured environments, enhancing user interaction and control.

Contribution

The paper introduces OVITA, a framework that integrates multiple pre-trained LLMs for open-vocabulary, language-driven trajectory adaptation in robots, allowing intuitive waypoint adjustments without expert knowledge.

Findings

01

Effective in simulation and real-world tasks

02

Supports diverse robotic platforms including manipulators and drones

03

Enables intuitive, natural language-based trajectory modifications

Abstract

Adapting trajectories to dynamic situations and user preferences is crucial for robot operation in unstructured environments with non-expert users. Natural language enables users to express these adjustments in an interactive manner. We introduce OVITA, an interpretable, open-vocabulary, language-driven framework designed for adapting robot trajectories in dynamic and novel situations based on human instructions. OVITA leverages multiple pre-trained Large Language Models (LLMs) to integrate user commands into trajectories generated by motion planners or those learned through demonstrations. OVITA employs code as an adaptation policy generated by an LLM, enabling users to adjust individual waypoints, thus providing flexible control. Another LLM, which acts as a code explainer, removes the need for expert users, enabling intuitive interactions. The efficacy and significance of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.