Environment-Invariant Curriculum Relation Learning for Fine-Grained   Scene Graph Generation

Yukuan Min; Aming Wu; Cheng Deng

arXiv:2308.03282·cs.CV·August 22, 2023

Environment-Invariant Curriculum Relation Learning for Fine-Grained Scene Graph Generation

Yukuan Min, Aming Wu, Cheng Deng

PDF

Open Access 1 Repo

TL;DR

This paper introduces EICR, a novel method for scene graph generation that addresses both predicate and context imbalance issues by learning environment-invariant relations and applying curriculum learning, leading to improved results.

Contribution

The paper proposes a plug-and-play EICR framework that tackles subject-object imbalance and predicate imbalance in SGG through environment-invariant learning and curriculum strategies.

Findings

01

Significant performance improvements on VG and GQA datasets.

02

EICR is compatible with various existing SGG models.

03

Effectively addresses class and context imbalance issues.

Abstract

The scene graph generation (SGG) task is designed to identify the predicates based on the subject-object pairs.However,existing datasets generally include two imbalance cases: one is the class imbalance from the predicted predicates and another is the context imbalance from the given subject-object pairs, which presents significant challenges for SGG. Most existing methods focus on the imbalance of the predicted predicate while ignoring the imbalance of the subject-object pairs, which could not achieve satisfactory results. To address the two imbalance cases, we propose a novel Environment Invariant Curriculum Relation learning (EICR) method, which can be applied in a plug-and-play fashion to existing SGG methods. Concretely, to remove the imbalance of the subject-object pairs, we first construct different distribution environments for the subject-object pairs and learn a model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

myukzzz/eicr
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Human Pose and Action Recognition

MethodsFocus