Deep Multiple Instance Feature Learning via Variational Autoencoder

Shabnam Ghaffarzadegan

arXiv:1807.02490·cs.LG·July 9, 2018

Deep Multiple Instance Feature Learning via Variational Autoencoder

Shabnam Ghaffarzadegan

PDF

Open Access

TL;DR

This paper introduces a deep learning framework combining discriminative models and variational autoencoders for multiple instance learning, improving representation quality and scalability for large datasets.

Contribution

It presents a novel weakly supervised deep learning approach that leverages VAEs to enhance MIL by learning meaningful latent representations and scaling to large datasets.

Findings

01

Outperforms state-of-the-art on benchmark datasets

02

Effective in audio event detection and segmentation

03

Latent representations discriminate positive and negative classes

Abstract

We describe a novel weakly supervised deep learning framework that combines both the discriminative and generative models to learn meaningful representation in the multiple instance learning (MIL) setting. MIL is a weakly supervised learning problem where labels are associated with groups of instances (referred as bags) instead of individual instances. To address the essential challenge in MIL problems raised from the uncertainty of positive instances label, we use a discriminative model regularized by variational autoencoders (VAEs) to maximize the differences between latent representations of all instances and negative instances. As a result, the hidden layer of the variational autoencoder learns meaningful representation. This representation can effectively be used for MIL problems as illustrated by better performance on the standard benchmark datasets comparing to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Video Analysis and Summarization

MethodsSolana Customer Service Number +1-833-534-1729