3D VSG: Long-term Semantic Scene Change Prediction through 3D Variable Scene Graphs
Samuel Looper, Javier Rodriguez-Puigvert, Roland Siegwart, Cesar, Cadena, and Lukas Schmid

TL;DR
This paper introduces Variable Scene Graphs (VSG) for modeling and predicting long-term semantic scene changes, enabling robots to better understand and adapt to dynamic environments over time.
Contribution
The work formalizes semantic scene variability estimation and proposes DeltaVSG, a novel supervised method for predicting scene change likelihoods in 3D environments.
Findings
DeltaVSG achieves 77.1% accuracy in variability estimation.
The method improves change detection speed by 66%.
VSG prediction aligns well with human intuition.
Abstract
Numerous applications require robots to operate in environments shared with other agents, such as humans or other robots. However, such shared scenes are typically subject to different kinds of long-term semantic scene changes. The ability to model and predict such changes is thus crucial for robot autonomy. In this work, we formalize the task of semantic scene variability estimation and identify three main varieties of semantic scene change: changes in the position of an object, its semantic state, or the composition of a scene as a whole. To represent this variability, we propose the Variable Scene Graph (VSG), which augments existing 3D Scene Graph (SG) representations with the variability attribute, representing the likelihood of discrete long-term change events. We present a novel method, DeltaVSG, to estimate the variability of VSGs in a supervised fashion. We evaluate our method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Robotics and Sensor-Based Localization
