Time is on my sight: scene graph filtering for dynamic environment perception in an LLM-driven robot
Simone Colombani, Luca Brini, Dimitri Ognibene, Giuseppe Boccignone

TL;DR
This paper introduces a robot control system that uses Large Language Models and dynamic scene graphs to enable robots to adaptively perceive, interpret, and act in complex, changing environments for improved human-robot interaction.
Contribution
It presents a novel architecture combining LLMs, semantic scene graphs, and particle filtering for real-time perception and adaptive planning in dynamic environments.
Findings
Effective real-time scene graph updating from RGB-D data
Enhanced robot adaptability and task efficiency
Improved human-robot collaboration in dynamic settings
Abstract
Robots are increasingly being used in dynamic environments like workplaces, hospitals, and homes. As a result, interactions with robots must be simple and intuitive, with robots perception adapting efficiently to human-induced changes. This paper presents a robot control architecture that addresses key challenges in human-robot interaction, with a particular focus on the dynamic creation and continuous update of the robot state representation. The architecture uses Large Language Models to integrate diverse information sources, including natural language commands, robotic skills representation, real-time dynamic semantic mapping of the perceived scene. This enables flexible and adaptive robotic behavior in complex, dynamic environments. Traditional robotic systems often rely on static, pre-programmed instructions and settings, limiting their adaptability to dynamic environments and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods
MethodsFocus
