Multi-Evidence Filtering and Fusion for Multi-Label Classification,   Object Detection and Semantic Segmentation Based on Weakly Supervised   Learning

Weifeng Ge; Sibei Yang; Yizhou Yu

arXiv:1802.09129·cs.CV·March 6, 2018·27 cites

Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning

Weifeng Ge, Sibei Yang, Yizhou Yu

PDF

Open Access

TL;DR

This paper introduces a weakly supervised curriculum learning pipeline that improves multi-label classification, object detection, and semantic segmentation by utilizing intermediate localization and pixel labeling, achieving state-of-the-art results.

Contribution

It proposes a novel multi-stage pipeline with a filtering and fusion algorithm that enhances weakly supervised learning for complex vision tasks.

Findings

01

Achieves state-of-the-art results in multi-label classification.

02

Attains very competitive results in semantic segmentation.

03

Improves weakly supervised object detection accuracy.

Abstract

Supervised object detection and semantic segmentation require object or even pixel level annotations. When there exist image level labels only, it is challenging for weakly supervised algorithms to achieve accurate predictions. The accuracy achieved by top weakly supervised algorithms is still significantly lower than their fully supervised counterparts. In this paper, we propose a novel weakly supervised curriculum learning pipeline for multi-label object recognition, detection and semantic segmentation. In this pipeline, we first obtain intermediate object localization and pixel labeling results for the training images, and then use such results to train task-specific deep networks in a fully supervised manner. The entire process consists of four stages, including object localization in the training images, filtering and fusing object instances, pixel labeling for the training images,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning