From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation
Hanrong Shi, Lin Li, Jun Xiao, Yueting Zhuang, Long Chen

TL;DR
This paper introduces a shape-aware feature learning strategy for Panoptic Scene Graph Generation that incrementally trains classifiers with increasing complexity, improving robustness and performance over existing methods.
Contribution
It proposes a novel curricular learning approach incorporating shape-aware features into PSG, enhancing robustness and outperforming state-of-the-art methods.
Findings
Outperforms existing methods on PSG tasks.
Improves robustness in zero-shot scenarios.
Effective integration of shape-aware features.
Abstract
Panoptic Scene Graph Generation (PSG) aims to generate a comprehensive graph-structure representation based on panoptic segmentation masks. Despite remarkable progress in PSG, almost all existing methods neglect the importance of shape-aware features, which inherently focus on the contours and boundaries of objects. To bridge this gap, we propose a model-agnostic Curricular shApe-aware FEature (CAFE) learning strategy for PSG. Specifically, we incorporate shape-aware features (i.e., mask features and boundary features) into PSG, moving beyond reliance solely on bbox features. Furthermore, drawing inspiration from human cognition, we propose to integrate shape-aware features in an easy-to-hard manner. To achieve this, we categorize the predicates into three groups based on cognition learning difficulty and correspondingly divide the training process into three stages. Each stage utilizes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Multimodal Machine Learning Applications
MethodsFocus · Knowledge Distillation
