Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning
Weifeng Ge, Sibei Yang, Yizhou Yu

TL;DR
This paper introduces a weakly supervised curriculum learning pipeline that improves multi-label classification, object detection, and semantic segmentation by utilizing intermediate localization and pixel labeling, achieving state-of-the-art results.
Contribution
It proposes a novel multi-stage pipeline with a filtering and fusion algorithm that enhances weakly supervised learning for complex vision tasks.
Findings
Achieves state-of-the-art results in multi-label classification.
Attains very competitive results in semantic segmentation.
Improves weakly supervised object detection accuracy.
Abstract
Supervised object detection and semantic segmentation require object or even pixel level annotations. When there exist image level labels only, it is challenging for weakly supervised algorithms to achieve accurate predictions. The accuracy achieved by top weakly supervised algorithms is still significantly lower than their fully supervised counterparts. In this paper, we propose a novel weakly supervised curriculum learning pipeline for multi-label object recognition, detection and semantic segmentation. In this pipeline, we first obtain intermediate object localization and pixel labeling results for the training images, and then use such results to train task-specific deep networks in a fully supervised manner. The entire process consists of four stages, including object localization in the training images, filtering and fusing object instances, pixel labeling for the training images,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
