Importance Weighted Structure Learning for Scene Graph Generation
Daqi Liu, Miroslaw Bober, Josef Kittler

TL;DR
This paper introduces an importance weighted structure learning approach for scene graph generation, employing a tighter variational bound and a Gumbel-Softmax sampler to improve performance over traditional methods.
Contribution
It proposes a novel importance weighted variational inference method with a Gumbel-Softmax sampler for more accurate scene graph generation.
Findings
Achieves state-of-the-art results on scene graph benchmarks.
Outperforms traditional variational methods in accuracy.
Demonstrates the effectiveness of importance weighting in structured prediction.
Abstract
Scene graph generation is a structured prediction task aiming to explicitly model objects and their relationships via constructing a visually-grounded scene graph for an input image. Currently, the message passing neural network based mean field variational Bayesian methodology is the ubiquitous solution for such a task, in which the variational inference objective is often assumed to be the classical evidence lower bound. However, the variational approximation inferred from such loose objective generally underestimates the underlying posterior, which often leads to inferior generation performance. In this paper, we propose a novel importance weighted structure learning method aiming to approximate the underlying log-partition function with a tighter importance weighted lower bound, which is computed from multiple samples drawn from a reparameterizable Gumbel-Softmax sampler. A generic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Visual Attention and Saliency Detection · Image Retrieval and Classification Techniques
MethodsVariational Inference
