Min-Entropy Latent Model for Weakly Supervised Object Detection

Fang Wan; Pengxu Wei; Zhenjun Han; Jianbin Jiao; Qixiang Ye

arXiv:1902.06057·cs.CV·February 19, 2019·1 cites

Min-Entropy Latent Model for Weakly Supervised Object Detection

Fang Wan, Pengxu Wei, Zhenjun Han, Jianbin Jiao, Qixiang Ye

PDF

Open Access 1 Repo

TL;DR

This paper introduces a min-entropy latent model (MELM) that effectively reduces randomness and ambiguity in weakly supervised object detection, leading to significant performance improvements across multiple tasks.

Contribution

The paper proposes MELM, a novel framework that uses min-entropy to better learn object locations and reduce localization ambiguity in weakly supervised detection.

Findings

01

MELM outperforms state-of-the-art methods in object detection and localization.

02

The proposed model significantly improves weakly supervised image classification.

03

Recurrent learning with continuation optimization effectively handles non-convexity.

Abstract

Weakly supervised object detection is a challenging task when provided with image category supervision but required to learn, at the same time, object locations and object detectors. The inconsistency between the weak supervision and learning objectives introduces significant randomness to object locations and ambiguity to detectors. In this paper, a min-entropy latent model (MELM) is proposed for weakly supervised object detection. Min-entropy serves as a model to learn object locations and a metric to measure the randomness of object localization during learning. It aims to principally reduce the variance of learned instances and alleviate the ambiguity of detectors. MELM is decomposed into three components including proposal clique partition, object clique discovery, and object localization. MELM is optimized with a recurrent learning algorithm, which leverages continuation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

WinFrand/MELM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques