Global Meets Local: Effective Multi-Label Image Classification via Category-Aware Weak Supervision
Jiawei Zhan, Jun Liu, Wei Tang, Guannan Jiang, Xi Wang, Bin-Bin Gao,, Tianliang Zhang, Wenlong Wu, Wei Zhang, Chengjie Wang, Yuan Xie

TL;DR
This paper introduces a unified framework for multi-label image classification that effectively suppresses noisy proposals and enhances global-local feature interaction through category-aware weak supervision and a cross-granularity attention module, improving accuracy.
Contribution
It proposes a novel category-aware weak supervision method and a cross-granularity attention module to better utilize global and local features in multi-label classification.
Findings
Achieves superior performance on MS-COCO and VOC 2007 datasets.
Effectively suppresses noisy proposals and enhances feature interaction.
Outperforms state-of-the-art methods in multi-label image classification.
Abstract
Multi-label image classification, which can be categorized into label-dependency and region-based methods, is a challenging problem due to the complex underlying object layouts. Although region-based methods are less likely to encounter issues with model generalizability than label-dependency methods, they often generate hundreds of meaningless or noisy proposals with non-discriminative information, and the contextual dependency among the localized regions is often ignored or over-simplified. This paper builds a unified framework to perform effective noisy-proposal suppression and to interact between global and local features for robust feature learning. Specifically, we propose category-aware weak supervision to concentrate on non-existent categories so as to provide deterministic information for local feature learning, restricting the local branch to focus on more high-quality regions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
