Mesh-based Object Tracking for Dynamic Semantic 3D Scene Graphs via Ray   Tracing

Lennart Niecksch; Alexander Mock; Felix Igelbrink; Thomas Wiemann,; Joachim Hertzberg

arXiv:2408.04979·cs.RO·August 12, 2024

Mesh-based Object Tracking for Dynamic Semantic 3D Scene Graphs via Ray Tracing

Lennart Niecksch, Alexander Mock, Felix Igelbrink, Thomas Wiemann,, Joachim Hertzberg

PDF

Open Access

TL;DR

This paper introduces a mesh-based object tracking method using ray tracing for dynamic semantic 3D scene graphs, improving robustness and accuracy in pose estimation and scene understanding.

Contribution

The paper proposes a hybrid approach combining ray tracing with keypoint detection for robust 3D object tracking and semantic scene graph generation.

Findings

01

Enhanced robustness under occlusions

02

Accurate pose tracking of objects

03

Effective integration into semantic mapping

Abstract

In this paper, we present a novel method for 3D geometric scene graph generation using range sensors and RGB cameras. We initially detect instance-wise keypoints with a YOLOv8s model to compute 6D pose estimates of known objects by solving PnP. We use a ray tracing approach to track a geometric scene graph consisting of mesh models of object instances. In contrast to classical point-to-point matching, this leads to more robust results, especially under occlusions between objects instances. We show that using this hybrid strategy leads to robust self-localization, pre-segmentation of the range sensor data and accurate pose tracking of objects using the same environmental representation. All detected objects are integrated into a semantic scene graph. This scene graph then serves as a front end to a semantic mapping framework to allow spatial reasoning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Advanced Image and Video Retrieval Techniques