Self-Supervised Relation Alignment for Scene Graph Generation
Bicheng Xu, Renjie Liao, Leonid Sigal

TL;DR
This paper introduces a self-supervised relational alignment method for scene graph generation that enhances existing models by aligning masked relation predictions with supervised ones through distillation.
Contribution
It proposes a novel self-supervised regularization technique that can be integrated with any scene graph generation framework to improve performance.
Findings
Significant performance improvements on benchmark datasets.
Effective when combined with SGTR and Neural Motifs architectures.
Enhances relation prediction accuracy through relational alignment.
Abstract
The goal of scene graph generation is to predict a graph from an input image, where nodes correspond to identified and localized objects and edges to their corresponding interaction predicates. Existing methods are trained in a fully supervised manner and focus on message passing mechanisms, loss functions, and/or bias mitigation. In this work we introduce a simple-yet-effective self-supervised relational alignment regularization designed to improve the scene graph generation performance. The proposed alignment is general and can be combined with any existing scene graph generation framework, where it is trained alongside the original model's objective. The alignment is achieved through distillation, where an auxiliary relation prediction branch, that mirrors and shares parameters with the supervised counterpart, is designed. In the auxiliary branch, relational input features are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Self-Supervised Relation Alignment for Scene Graph Generation· youtube
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Graph Neural Networks · Topic Modeling
