Object-level 3D Semantic Mapping using a Network of Smart Edge Sensors
Julian Hau, Simon Bultmann, Sven Behnke

TL;DR
This paper presents a system that enhances 3D semantic mapping with object-level details using smart edge sensors, enabling real-time, accurate object pose estimation and tracking for improved scene understanding.
Contribution
It introduces a novel object-level mapping approach with pose estimation and tracking in a distributed sensor network, advancing scene understanding capabilities.
Findings
Pose estimation accuracy within a few centimeters.
Real-time object tracking under high occlusion.
Effective integration of object models into semantic maps.
Abstract
Autonomous robots that interact with their environment require a detailed semantic scene model. For this, volumetric semantic maps are frequently used. The scene understanding can further be improved by including object-level information in the map. In this work, we extend a multi-view 3D semantic mapping system consisting of a network of distributed smart edge sensors with object-level information, to enable downstream tasks that need object-level input. Objects are represented in the map via their 3D mesh model or as an object-centric volumetric sub-map that can model arbitrary object geometry when no detailed 3D model is available. We propose a keypoint-based approach to estimate object poses via PnP and refinement via ICP alignment of the 3D object model with the observed point cloud segments. Object instances are tracked to integrate observations over time and to be robust against…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage · Advanced Vision and Imaging
MethodsPnP
