Hyper-relationship Learning Network for Scene Graph Generation
Yibing Zhan, Zhi Chen, Jun Yu, BaoSheng Yu, Dacheng Tao, Yong Luo

TL;DR
This paper introduces HLN, a hyper-relationship learning network that enhances scene graph generation by integrating object and relationship interactions with transitive reasoning, significantly improving performance on the Visual Genome dataset.
Contribution
The paper proposes a novel HLN model utilizing hypergraphs and dual graph attention networks to perform transitive inference and better integrate graph component interactions in scene graph generation.
Findings
HLN improves recall per relationship from 11.3% to 13.1%.
HLN maintains recall per image from 19.8% to 34.9%.
HLN outperforms recent state-of-the-art methods on Visual Genome.
Abstract
Generating informative scene graphs from images requires integrating and reasoning from various graph components, i.e., objects and relationships. However, current scene graph generation (SGG) methods, including the unbiased SGG methods, still struggle to predict informative relationships due to the lack of 1) high-level inference such as transitive inference between relationships and 2) efficient mechanisms that can incorporate all interactions of graph components. To address the issues mentioned above, we devise a hyper-relationship learning network, termed HLN, for SGG. Specifically, the proposed HLN stems from hypergraphs and two graph attention networks (GATs) are designed to infer relationships: 1) the object-relationship GAT or OR-GAT to explore interactions between objects and relationships, and 2) the hyper-relationship GAT or HR-GAT to integrate transitive inference of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Advanced Graph Neural Networks
MethodsGraph Attention Network
