Multi-label Classification with Panoptic Context Aggregation Networks
Mingyuan Jiu, Hailong Zhu, Wenchuan Wei, Hichem Sahbi, Rongrong Ji, Mingliang Xu

TL;DR
The paper introduces PanCAN, a novel deep network that hierarchically integrates multi-scale, multi-order geometric contexts for improved multi-label image classification, outperforming existing methods on standard benchmarks.
Contribution
It proposes a new hierarchical framework that combines cross-scale feature aggregation with attention mechanisms to model complex scene contexts in multi-label classification.
Findings
PanCAN outperforms state-of-the-art methods on NUS-WIDE, PASCAL VOC2007, and MS-COCO.
The approach effectively models multi-scale, multi-order geometric relationships.
Experimental results show significant improvements in classification accuracy.
Abstract
Context modeling is crucial for visual recognition, enabling highly discriminative image representations by integrating both intrinsic and extrinsic relationships between objects and labels in images. A limitation in current approaches is their focus on basic geometric relationships or localized features, often neglecting cross-scale contextual interactions between objects. This paper introduces the Deep Panoptic Context Aggregation Network (PanCAN), a novel approach that hierarchically integrates multi-order geometric contexts through cross-scale feature aggregation in a high-dimensional Hilbert space. Specifically, PanCAN learns multi-order neighborhood relationships at each scale by combining random walks with an attention mechanism. Modules from different scales are cascaded, where salient anchors at a finer scale are selected and their neighborhood features are dynamically fused…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
