FDSG: Forecasting Dynamic Scene Graphs

Yi Yang; Yuren Cong; Hao Cheng; Bodo Rosenhahn; Michael Ying Yang

arXiv:2506.01487·cs.CV·July 21, 2025

FDSG: Forecasting Dynamic Scene Graphs

Yi Yang, Yuren Cong, Hao Cheng, Bodo Rosenhahn, Michael Ying Yang

PDF

TL;DR

FDSG is a novel framework that predicts future scene graphs in videos by modeling entity and relationship dynamics, outperforming existing methods on dynamic scene graph tasks.

Contribution

The paper introduces FDSG, a new approach that forecasts future scene graphs in videos using query decomposition and neural stochastic differential equations.

Findings

01

FDSG outperforms state-of-the-art methods on scene graph forecasting tasks.

02

Introduces Scene Graph Forecasting, a new benchmark for future scene graph prediction.

03

Effectively models entity and relationship dynamics over time.

Abstract

Dynamic scene graph generation extends scene graph generation from images to videos by modeling entity relationships and their temporal evolution. However, existing methods either generate scene graphs from observed frames without explicitly modeling temporal dynamics, or predict only relationships while assuming static entity labels and locations. These limitations hinder effective extrapolation of both entity and relationship dynamics, restricting video scene understanding. We propose Forecasting Dynamic Scene Graphs (FDSG), a novel framework that predicts future entity labels, bounding boxes, and relationships, for unobserved frames, while also generating scene graphs for observed frames. Our scene graph forecast module leverages query decomposition and neural stochastic differential equations to model entity and relationship dynamics. A temporal aggregation module further refines…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.