TL;DR
Kimera introduces a novel 3D Dynamic Scene Graph representation and a fully automatic method to build it from visual-inertial data, enhancing robotic scene understanding and enabling real-time semantic path planning.
Contribution
The paper presents Kimera, the first fully automatic system to construct 3D Dynamic Scene Graphs from visual-inertial data, integrating SLAM, semantic mapping, and scene understanding.
Findings
Kimera achieves state-of-the-art visual-inertial SLAM performance.
It accurately estimates 3D metric-semantic meshes in real-time.
It constructs complex indoor environment DSGs with many objects and humans in minutes.
Abstract
Humans are able to form a complex mental model of the environment they move in. This mental model captures geometric and semantic aspects of the scene, describes the environment at multiple levels of abstractions (e.g., objects, rooms, buildings), includes static and dynamic entities and their relations (e.g., a person is in a room at a given time). In contrast, current robots' internal representations still provide a partial and fragmented understanding of the environment, either in the form of a sparse or dense set of geometric primitives (e.g., points, lines, planes, voxels) or as a collection of objects. This paper attempts to reduce the gap between robot and human perception by introducing a novel representation, a 3D Dynamic Scene Graph(DSG), that seamlessly captures metric and semantic aspects of a dynamic environment. A DSG is a layered graph where nodes represent spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsWebull Customer Care Number +1-833-534-1729
