One to rule them all: natural language to bind communication, perception and action
Simone Colombani, Dimitri Ognibene, Giuseppe Boccignone

TL;DR
This paper introduces an advanced robotic architecture that integrates natural language understanding, perception, and planning using Large Language Models within a dynamic feedback system for improved human-robot interaction.
Contribution
It presents a novel architecture combining LLMs with perception and planning modules, enabling robots to interpret natural language commands and adapt plans in real-time environments.
Findings
Enhanced robot adaptability in dynamic environments
Effective natural language command translation
Real-time plan adjustment based on feedback
Abstract
In recent years, research in the area of human-robot interaction has focused on developing robots capable of understanding complex human instructions and performing tasks in dynamic and diverse environments. These systems have a wide range of applications, from personal assistance to industrial robotics, emphasizing the importance of robots interacting flexibly, naturally and safely with humans. This paper presents an advanced architecture for robotic action planning that integrates communication, perception, and planning with Large Language Models (LLMs). Our system is designed to translate commands expressed in natural language into executable robot actions, incorporating environmental information and dynamically updating plans based on real-time feedback. The Planner Module is the core of the system where LLMs embedded in a modified ReAct framework are employed to interpret and carry…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage, Metaphor, and Cognition
