EmbodiedLGR: Integrating Lightweight Graph Representation and Retrieval for Semantic-Spatial Memory in Robotic Agents
Paolo Riva, Leonardo Gargani, Matteo Frosi, Matteo Matteucci

TL;DR
EmbodiedLGR is a novel robot memory system combining lightweight graph representations with retrieval methods, enabling efficient environment understanding and real-time human-robot interaction.
Contribution
The paper introduces EmbodiedLGR, a hybrid visual-language model architecture that improves memory efficiency and inference speed for embodied robotic agents.
Findings
Achieves state-of-the-art inference and query times on NaVQA dataset.
Retains competitive accuracy compared to current approaches.
Successfully deployed on a physical robot for real-world human-robot interaction.
Abstract
As the world of agentic artificial intelligence applied to robotics evolves, the need for agents capable of building and retrieving memories and observations efficiently is increasing. Robots operating in complex environments must build memory structures to enable useful human-robot interactions by leveraging the mnemonic representation of the current operating context. People interacting with robots may expect the embodied agent to provide information about locations, events, or objects, which requires the agent to provide precise answers within human-like inference times to be perceived as responsive. We propose the Embodied Light Graph Retrieval Agent (EmbodiedLGR-Agent), a visual-language model (VLM)-driven agent architecture that constructs dense and efficient representations of robot operating environments. EmbodiedLGR-Agent directly addresses the need for an efficient memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
