Adaptive Visual Scene Understanding: Incremental Scene Graph Generation

Naitik Khandelwal; Xiao Liu; Mengmi Zhang

arXiv:2310.01636·cs.CV·November 4, 2024

Adaptive Visual Scene Understanding: Incremental Scene Graph Generation

Naitik Khandelwal, Xiao Liu, Mengmi Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a new benchmark and a novel replay-based method for continual scene graph generation, addressing the challenges of dynamic object relationships and scene understanding in evolving visual environments.

Contribution

It systematically explores continual learning in scene graph generation, proposes the RAS method leveraging scene synthesis, and provides a comprehensive benchmark for future research.

Findings

01

Existing methods struggle with continual SGG tasks.

02

The RAS approach improves efficiency and scene understanding.

03

Benchmarking reveals key challenges in continual SGG.

Abstract

Scene graph generation (SGG) analyzes images to extract meaningful information about objects and their relationships. In the dynamic visual world, it is crucial for AI systems to continuously detect new objects and establish their relationships with existing ones. Recently, numerous studies have focused on continual learning within the domains of object detection and image recognition. However, a limited amount of research focuses on a more challenging continual learning problem in SGG. This increased difficulty arises from the intricate interactions and dynamic relationships among objects, and their associated contexts. Thus, in continual learning, SGG models are often required to expand, modify, retain, and reason scene graphs within the process of adaptive visual scene understanding. To systematically explore Continual Scene Graph Generation (CSEGG), we present a comprehensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhanglab-deepneurocoglab/csegg
pytorchOfficial

Videos

Adaptive Visual Scene Understanding: Incremental Scene Graph Generation· slideslive

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques