Leveraging image captions for selective whole slide image annotation
Jingna Qiu, Marc Aubreville, Frauke Wilm, Mathias \"Ottl, Jonas Utz,, Maja Schlereth, Katharina Breininger

TL;DR
This paper introduces prototype sampling, a novel method for selecting informative image regions for annotation in whole slide images, leveraging image captions to improve deep learning model training efficiency.
Contribution
It proposes a new prototype sampling technique that uses image-caption data to identify task-relevant regions, outperforming existing diversity sampling methods.
Findings
Prototype sampling outperforms random and diversity sampling.
Improved model performance in tissue segmentation and mitotic detection.
Effective identification of valuable training regions.
Abstract
Acquiring annotations for whole slide images (WSIs)-based deep learning tasks, such as creating tissue segmentation masks or detecting mitotic figures, is a laborious process due to the extensive image size and the significant manual work involved in the annotation. This paper focuses on identifying and annotating specific image regions that optimize model training, given a limited annotation budget. While random sampling helps capture data variance by collecting annotation regions throughout the WSIs, insufficient data curation may result in an inadequate representation of minority classes. Recent studies proposed diversity sampling to select a set of regions that maximally represent unique characteristics of the WSIs. This is done by pretraining on unlabeled data through self-supervised learning and then clustering all regions in the latent space. However, establishing the optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Video Analysis and Summarization
MethodsSparse Evolutionary Training
