Deep Multiple Instance Feature Learning via Variational Autoencoder
Shabnam Ghaffarzadegan

TL;DR
This paper introduces a deep learning framework combining discriminative models and variational autoencoders for multiple instance learning, improving representation quality and scalability for large datasets.
Contribution
It presents a novel weakly supervised deep learning approach that leverages VAEs to enhance MIL by learning meaningful latent representations and scaling to large datasets.
Findings
Outperforms state-of-the-art on benchmark datasets
Effective in audio event detection and segmentation
Latent representations discriminate positive and negative classes
Abstract
We describe a novel weakly supervised deep learning framework that combines both the discriminative and generative models to learn meaningful representation in the multiple instance learning (MIL) setting. MIL is a weakly supervised learning problem where labels are associated with groups of instances (referred as bags) instead of individual instances. To address the essential challenge in MIL problems raised from the uncertainty of positive instances label, we use a discriminative model regularized by variational autoencoders (VAEs) to maximize the differences between latent representations of all instances and negative instances. As a result, the hidden layer of the variational autoencoder learns meaningful representation. This representation can effectively be used for MIL problems as illustrated by better performance on the standard benchmark datasets comparing to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Video Analysis and Summarization
MethodsSolana Customer Service Number +1-833-534-1729
