Min-Entropy Latent Model for Weakly Supervised Object Detection
Fang Wan, Pengxu Wei, Zhenjun Han, Jianbin Jiao, Qixiang Ye

TL;DR
This paper introduces a min-entropy latent model (MELM) that effectively reduces randomness and ambiguity in weakly supervised object detection, leading to significant performance improvements across multiple tasks.
Contribution
The paper proposes MELM, a novel framework that uses min-entropy to better learn object locations and reduce localization ambiguity in weakly supervised detection.
Findings
MELM outperforms state-of-the-art methods in object detection and localization.
The proposed model significantly improves weakly supervised image classification.
Recurrent learning with continuation optimization effectively handles non-convexity.
Abstract
Weakly supervised object detection is a challenging task when provided with image category supervision but required to learn, at the same time, object locations and object detectors. The inconsistency between the weak supervision and learning objectives introduces significant randomness to object locations and ambiguity to detectors. In this paper, a min-entropy latent model (MELM) is proposed for weakly supervised object detection. Min-entropy serves as a model to learn object locations and a metric to measure the randomness of object localization during learning. It aims to principally reduce the variance of learned instances and alleviate the ambiguity of detectors. MELM is decomposed into three components including proposal clique partition, object clique discovery, and object localization. MELM is optimized with a recurrent learning algorithm, which leverages continuation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
