MixUp-MIL: Novel Data Augmentation for Multiple Instance Learning and a Study on Thyroid Cancer Diagnosis
Michael Gadermayr, Lukas Koller, Maximilian Tschuchnig, Lea Maria, Stangassinger, Christina Kreutzer, Sebastien Couillard-Despres, Gertie, Janneke Oostingh, Anton Hittmair

TL;DR
This paper introduces a novel intra-slide data augmentation method based on MixUp for multiple instance learning, significantly improving thyroid cancer diagnosis accuracy from whole slide images.
Contribution
The study proposes a new intra-slide interpolation technique for data augmentation in multiple instance learning, demonstrating its effectiveness over traditional MixUp methods.
Findings
Intra-slide interpolation improves diagnosis accuracy.
Traditional MixUp decreases model performance.
The method is validated on thyroid cancer datasets.
Abstract
Multiple instance learning exhibits a powerful approach for whole slide image-based diagnosis in the absence of pixel- or patch-level annotations. In spite of the huge size of hole slide images, the number of individual slides is often rather small, leading to a small number of labeled samples. To improve training, we propose and investigate different data augmentation strategies for multiple instance learning based on the idea of linear interpolations of feature vectors (known as MixUp). Based on state-of-the-art multiple instance learning architectures and two thyroid cancer data sets, an exhaustive study is conducted considering a range of common data augmentation strategies. Whereas a strategy based on to the original MixUp approach showed decreases in accuracy, the use of a novel intra-slide interpolation method led to consistent increases in accuracy.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Video Analysis and Summarization · Colorectal Cancer Screening and Detection
MethodsMixup
