Attention-based Deep Multiple Instance Learning
Maximilian Ilse, Jakub M. Tomczak, Max Welling

TL;DR
This paper introduces an attention-based neural network approach for multiple instance learning that models bag labels probabilistically, offers interpretability, and achieves competitive results on various datasets.
Contribution
It proposes a novel neural network permutation-invariant attention mechanism for MIL, enhancing interpretability and performance.
Findings
Achieves comparable performance to top MIL methods on benchmarks.
Outperforms existing methods on MNIST-based and histopathology datasets.
Provides interpretability by highlighting instance contributions.
Abstract
Multiple instance learning (MIL) is a variation of supervised learning where a single class label is assigned to a bag of instances. In this paper, we state the MIL problem as learning the Bernoulli distribution of the bag label where the bag label probability is fully parameterized by neural networks. Furthermore, we propose a neural network-based permutation-invariant aggregation operator that corresponds to the attention mechanism. Notably, an application of the proposed attention-based operator provides insight into the contribution of each instance to the bag label. We show empirically that our approach achieves comparable performance to the best MIL methods on benchmark MIL datasets and it outperforms other methods on a MNIST-based MIL dataset and two real-life histopathology datasets without sacrificing interpretability.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Video Analysis and Summarization · AI in cancer detection
