Generative Compositional Augmentations for Scene Graph Prediction
Boris Knyazev, Harm de Vries, C\u{a}t\u{a}lina Cangea, Graham W., Taylor, Aaron Courville, Eugene Belilovsky

TL;DR
This paper introduces a method to generate synthetic scene graphs using GANs to improve the ability of models to recognize rare or unseen object relationships, enhancing compositional generalization in scene graph prediction.
Contribution
It proposes a novel approach to synthesize plausible scene graphs for rare compositions, addressing the challenge of compositional generalization in scene graph prediction.
Findings
Marginal improvements in zero-shot and few-shot metrics on Visual Genome
Effective synthesis of rare scene graph compositions
Analysis of limitations and future directions
Abstract
Inferring objects and their relationships from an image in the form of a scene graph is useful in many applications at the intersection of vision and language. We consider a challenging problem of compositional generalization that emerges in this task due to a long tail data distribution. Current scene graph generation models are trained on a tiny fraction of the distribution corresponding to the most frequent compositions, e.g. <cup, on, table>. However, test images might contain zero- and few-shot compositions of objects and relationships, e.g. <cup, on, surfboard>. Despite each of the object categories and the predicate (e.g. 'on') being frequent in the training data, the models often fail to properly understand such unseen or rare compositions. To improve generalization, it is natural to attempt increasing the diversity of the training distribution. However, in the graph domain this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis
MethodsLinear Layer · Attention Dropout · Adam · Dense Connections · Dropout · Linear Warmup With Linear Decay · Residual Connection · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Multi-Head Attention
