Seeing, Saying, Solving: An LLM-to-TL Framework for Cooperative Robots

Dan BW Choe; Sundhar Vinodh Sangeetha; Steven Emanuel; Chih-Yuan Chiu; Samuel Coogan; Shreyas Kousik

arXiv:2505.13376·cs.RO·May 20, 2025

Seeing, Saying, Solving: An LLM-to-TL Framework for Cooperative Robots

Dan BW Choe, Sundhar Vinodh Sangeetha, Steven Emanuel, Chih-Yuan Chiu, Samuel Coogan, Shreyas Kousik

PDF

Open Access

TL;DR

This paper introduces a decentralized framework enabling heterogeneous robots to collaboratively detect conflicts, request help via natural language, and reason over offers to optimize overall system efficiency using LLMs and formal logic grounded in MILP.

Contribution

It presents a novel decentralized approach combining vision, language, and formal logic for cooperative robot assistance, ensuring syntactic correctness and system-wide optimization.

Findings

01

Robots can effectively detect conflicts using VLM.

02

The framework enables natural language help requests and offers.

03

Considering multiple help offers improves overall system efficiency.

Abstract

Increased robot deployment, such as in warehousing, has revealed a need for seamless collaboration among heterogeneous robot teams to resolve unforeseen conflicts. To address this challenge, we propose a novel, decentralized framework for robots to request and provide help. The framework begins with robots detecting conflicts using a Vision Language Model (VLM), then reasoning over whether help is needed. If so, it crafts and broadcasts a natural language (NL) help request using a Large Language Model (LLM). Potential helper robots reason over the request and offer help (if able), along with information about impact to their current tasks. Helper reasoning is implemented via an LLM grounded in Signal Temporal Logic (STL) using a Backus-Naur Form (BNF) grammar to guarantee syntactically valid NL-to-STL translations, which are then solved as a Mixed Integer Linear Program (MILP). Finally,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Social Robot Interaction and HRI · Reinforcement Learning in Robotics