Doubly Reparameterized Importance Weighted Structure Learning for Scene Graph Generation
Daqi Liu, Miroslaw Bober, Josef Kittler

TL;DR
This paper introduces a novel importance weighted structure learning approach for scene graph generation, employing a tighter variational bound and reparameterization techniques to improve accuracy and reduce variance, achieving state-of-the-art results.
Contribution
It proposes a doubly reparameterized importance weighted variational inference method with a Gumbel-Softmax sampler for scene graph generation, enhancing posterior approximation.
Findings
Achieves state-of-the-art performance on scene graph benchmarks.
Reduces variance of gradient estimates, improving learning stability.
Employs a tighter variational lower bound for better posterior modeling.
Abstract
As a structured prediction task, scene graph generation, given an input image, aims to explicitly model objects and their relationships by constructing a visually-grounded scene graph. In the current literature, such task is universally solved via a message passing neural network based mean field variational Bayesian methodology. The classical loose evidence lower bound is generally chosen as the variational inference objective, which could induce oversimplified variational approximation and thus underestimate the underlying complex posterior. In this paper, we propose a novel doubly reparameterized importance weighted structure learning method, which employs a tighter importance weighted lower bound as the variational inference objective. It is computed from multiple samples drawn from a reparameterizable Gumbel-Softmax sampler and the resulting constrained variational inference task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Recommender Systems and Techniques
MethodsVariational Inference
