Hierarchical Memory Learning for Fine-Grained Scene Graph Generation
Youming Deng, Yansheng Li, Yongjun Zhang, Xiang Xiang, Jian Wang,, Jingdong Chen, Jiayi Ma

TL;DR
This paper introduces a Hierarchical Memory Learning framework for Scene Graph Generation that mimics human hierarchical learning, improving predicate prediction by addressing data imbalance and granularity issues.
Contribution
It proposes a novel HML framework with Concept and Model Reconstruction constraints, enabling hierarchical learning from coarse to fine predicates in SGG.
Findings
Significant performance improvements on Visual Genome benchmark.
Effective alleviation of long-tail and mixed-granularity problems.
Framework applicable to various SGG models.
Abstract
As far as Scene Graph Generation (SGG), coarse and fine predicates mix in the dataset due to the crowd-sourced labeling, and the long-tail problem is also pronounced. Given this tricky situation, many existing SGG methods treat the predicates equally and learn the model under the supervision of mixed-granularity predicates in one stage, leading to relatively coarse predictions. In order to alleviate the negative impact of the suboptimum mixed-granularity annotation and long-tail effect problems, this paper proposes a novel Hierarchical Memory Learning (HML) framework to learn the model from simple to complex, which is similar to the human beings' hierarchical memory learning process. After the autonomous partition of coarse and fine predicates, the model is first trained on the coarse predicates and then learns the fine predicates. In order to realize this hierarchical learning pattern,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Image Retrieval and Classification Techniques
