Graph Density-Aware Losses for Novel Compositions in Scene Graph Generation
Boris Knyazev, Harm de Vries, C\u{a}t\u{a}lina Cangea, Graham W., Taylor, Aaron Courville, Eugene Belilovsky

TL;DR
This paper identifies limitations in current scene graph generation methods related to graph density and relationship bias, proposing a density-normalized loss and a new evaluation metric to improve zero-shot and few-shot generalization.
Contribution
It introduces a simple, effective density-normalized edge loss and a novel weighted evaluation metric to enhance generalization in scene graph generation, especially for rare compositions.
Findings
Over two-fold improvement in generalization metrics
Standard loss is influenced by scene graph density
New weighted metric better evaluates zero/few-shot performance
Abstract
Scene graph generation (SGG) aims to predict graph-structured descriptions of input images, in the form of objects and relationships between them. This task is becoming increasingly useful for progress at the interface of vision and language. Here, it is important - yet challenging - to perform well on novel (zero-shot) or rare (few-shot) compositions of objects and relationships. In this paper, we identify two key issues that limit such generalization. Firstly, we show that the standard loss used in this task is unintentionally a function of scene graph density. This leads to the neglect of individual edges in large sparse graphs during training, even though these contain diverse few-shot examples that are important for generalization. Secondly, the frequency of relationships can create a strong bias in this task, such that a blind model predicting the most frequent relationship…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Topic Modeling
