Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images
Zhenzhen Wang, Carla Saoud, Sintawat Wangsiricharoen, Aaron W. James,, Aleksander S. Popel, Jeremias Sulam

TL;DR
This paper introduces LC-MIL, a novel method for refining coarse pathology annotations on single whole-slide images, improving accuracy without external data, and reducing pathologists' workload.
Contribution
The paper presents LC-MIL, a lightweight multiple instance learning approach that refines coarse annotations on individual WSIs without requiring large training datasets.
Findings
LC-MIL significantly improves annotation accuracy.
Outperforms state-of-the-art methods on diverse cancer datasets.
Effective in refining real pathologist annotations.
Abstract
Annotating cancerous regions in whole-slide images (WSIs) of pathology samples plays a critical role in clinical diagnosis, biomedical research, and machine learning algorithms development. However, generating exhaustive and accurate annotations is labor-intensive, challenging, and costly. Drawing only coarse and approximate annotations is a much easier task, less costly, and it alleviates pathologists' workload. In this paper, we study the problem of refining these approximate annotations in digital pathology to obtain more accurate ones. Some previous works have explored obtaining machine learning models from these inaccurate annotations, but few of them tackle the refinement problem where the mislabeled regions should be explicitly identified and corrected, and all of them require a -- often very large -- number of training samples. We present a method, named Label Cleaning Multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Digital Imaging for Blood Diseases · Generative Adversarial Networks and Image Synthesis
