Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation

Thong Thanh Nguyen; Xiaobao Wu; Yi Bin; Cong-Duy T Nguyen; See-Kiong Ng; Anh Tuan Luu

arXiv:2412.07160·cs.CV·April 28, 2026

Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation

Thong Thanh Nguyen, Xiaobao Wu, Yi Bin, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

PDF

1 Repo

TL;DR

This paper proposes a motion-aware contrastive learning framework to improve temporal scene graph generation by emphasizing motion patterns, outperforming existing methods on video and 4D datasets.

Contribution

It introduces a novel contrastive learning approach that leverages motion patterns to enhance temporal scene graph generation, addressing limitations of previous methods.

Findings

01

Significant performance improvements on video datasets.

02

Effective utilization of motion patterns in scene graph generation.

03

Outperforms state-of-the-art methods on multiple datasets.

Abstract

To equip artificial intelligence with a comprehensive understanding towards a temporal world, video and 4D panoptic scene graph generation abstracts visual data into nodes to represent entities and edges to capture temporal relations. Existing methods encode entity masks tracked across temporal dimensions (mask tubes), then predict their relations with temporal pooling operation, which does not fully utilize the motion indicative of the entities' relation. To overcome this limitation, we introduce a contrastive representation learning framework that focuses on motion pattern for temporal scene graph generation. Firstly, our framework encourages the model to learn close representations for mask tubes of similar subject-relation-object triplets. Secondly, we seek to push apart mask tubes from their temporally shuffled versions. Moreover, we also learn distant representations for mask…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nguyentthong/motion-contrastive-sgg
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.