Multiple instance learning on deep features for weakly supervised object   detection with extreme domain shifts

Nicolas Gonthier; Sa\"id Ladjal; Yann Gousseau

arXiv:2008.01178·cs.CV·November 15, 2021

Multiple instance learning on deep features for weakly supervised object detection with extreme domain shifts

Nicolas Gonthier, Sa\"id Ladjal, Yann Gousseau

PDF

2 Repos 1 Datasets

TL;DR

This paper demonstrates that a straightforward multiple instance learning approach on pre-trained deep features can effectively perform weakly supervised object detection across diverse and challenging non-photographic datasets without fine-tuning.

Contribution

It introduces a simple, fine-tuning-free multiple instance learning method that generalizes well to various non-photographic domains and unseen classes.

Findings

01

Achieves competitive results on paintings, watercolors, cliparts, and comics datasets.

02

No fine-tuning or cross-domain learning required, enabling quick adaptation.

03

Effective for new visual categories without extensive training.

Abstract

Weakly supervised object detection (WSOD) using only image-level annotations has attracted a growing attention over the past few years. Whereas such task is typically addressed with a domain-specific solution focused on natural images, we show that a simple multiple instance approach applied on pre-trained deep features yields excellent performances on non-photographic datasets, possibly including new classes. The approach does not include any fine-tuning or cross-domain learning and is therefore efficient and possibly applicable to arbitrary datasets and classes. We investigate several flavors of the proposed approach, some including multi-layers perceptron and polyhedral classifiers. Despite its simplicity, our method shows competitive results on a range of publicly available datasets, including paintings (People-Art, IconArt), watercolors, cliparts and comics and allows to quickly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

NGonthier/IconArt
dataset· 17 dl
17 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.