Time is on my sight: scene graph filtering for dynamic environment   perception in an LLM-driven robot

Simone Colombani; Luca Brini; Dimitri Ognibene; Giuseppe Boccignone

arXiv:2411.15027·cs.RO·November 25, 2024

Time is on my sight: scene graph filtering for dynamic environment perception in an LLM-driven robot

Simone Colombani, Luca Brini, Dimitri Ognibene, Giuseppe Boccignone

PDF

Open Access

TL;DR

This paper introduces a robot control system that uses Large Language Models and dynamic scene graphs to enable robots to adaptively perceive, interpret, and act in complex, changing environments for improved human-robot interaction.

Contribution

It presents a novel architecture combining LLMs, semantic scene graphs, and particle filtering for real-time perception and adaptive planning in dynamic environments.

Findings

01

Effective real-time scene graph updating from RGB-D data

02

Enhanced robot adaptability and task efficiency

03

Improved human-robot collaboration in dynamic settings

Abstract

Robots are increasingly being used in dynamic environments like workplaces, hospitals, and homes. As a result, interactions with robots must be simple and intuitive, with robots perception adapting efficiently to human-induced changes. This paper presents a robot control architecture that addresses key challenges in human-robot interaction, with a particular focus on the dynamic creation and continuous update of the robot state representation. The architecture uses Large Language Models to integrate diverse information sources, including natural language commands, robotic skills representation, real-time dynamic semantic mapping of the perceived scene. This enables flexible and adaptive robotic behavior in complex, dynamic environments. Traditional robotic systems often rely on static, pre-programmed instructions and settings, limiting their adaptability to dynamic environments and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods

MethodsFocus