CaT: Weakly Supervised Object Detection with Category Transfer
Tianyue Cao, Lianyu Du, Xiaoyun Zhang, Siheng Chen, Ya Zhang, Yan-Feng, Wang

TL;DR
This paper introduces a category transfer framework that leverages fully-supervised datasets to improve weakly-supervised object detection by exploiting discriminative and semantic category information, significantly boosting detection accuracy.
Contribution
The proposed framework effectively utilizes both visually-discriminative and semantically-correlated category information through double-supervision and semantic graph convolutional networks, addressing domain gaps and overlapping categories.
Findings
Achieves 63.5% mAP on Pascal VOC 2007 with 5 overlapping categories.
Outperforms state-of-the-art methods in weakly-supervised object detection.
Demonstrates effective transfer of category knowledge from COCO to Pascal VOC.
Abstract
A large gap exists between fully-supervised object detection and weakly-supervised object detection. To narrow this gap, some methods consider knowledge transfer from additional fully-supervised dataset. But these methods do not fully exploit discriminative category information in the fully-supervised dataset, thus causing low mAP. To solve this issue, we propose a novel category transfer framework for weakly supervised object detection. The intuition is to fully leverage both visually-discriminative and semantically-correlated category information in the fully-supervised dataset to enhance the object-classification ability of a weakly-supervised detector. To handle overlapping category transfer, we propose a double-supervision mean teacher to gather common category information and bridge the domain gap between two datasets. To handle non-overlapping category transfer, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
