TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable   Industrial Robotics Through Large/Vision Language Models

Ammar N. Abbas; Csaba Beleznai

arXiv:2412.15462·cs.RO·December 23, 2024

TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models

Ammar N. Abbas, Csaba Beleznai

PDF

TL;DR

This paper explores integrating Large Language Models and Vision Language Models with industrial robots to improve interpretability, safety, and natural language command execution in human-robot interactions.

Contribution

It introduces four LLM-assisted robotic control workflows that enhance understanding, safety, and task planning through natural language and visual inputs.

Findings

01

Successful simulation of low-level control with LLMs

02

Generation of human-readable feedback on robot states

03

Use of visual and structural info for task planning

Abstract

TalkWithMachines aims to enhance human-robot interaction by contributing to interpretable industrial robotic systems, especially for safety-critical applications. The presented paper investigates recent advancements in Large Language Models (LLMs) and Vision Language Models (VLMs), in combination with robotic perception and control. This integration allows robots to understand and execute commands given in natural language and to perceive their environment through visual and/or descriptive inputs. Moreover, translating the LLM's internal states and reasoning into text that humans can easily understand ensures that operators gain a clearer insight into the robot's current state and intentions, which is essential for effective and safe operation. Our paper outlines four LLM-assisted simulated robotic control workflows, which explore (i) low-level control, (ii) the generation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training