Label Cleaning Multiple Instance Learning: Refining Coarse Annotations   on Single Whole-Slide Images

Zhenzhen Wang; Carla Saoud; Sintawat Wangsiricharoen; Aaron W. James,; Aleksander S. Popel; Jeremias Sulam

arXiv:2109.10778·cs.CV·June 9, 2022

Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images

Zhenzhen Wang, Carla Saoud, Sintawat Wangsiricharoen, Aaron W. James,, Aleksander S. Popel, Jeremias Sulam

PDF

Open Access 1 Repo

TL;DR

This paper introduces LC-MIL, a novel method for refining coarse pathology annotations on single whole-slide images, improving accuracy without external data, and reducing pathologists' workload.

Contribution

The paper presents LC-MIL, a lightweight multiple instance learning approach that refines coarse annotations on individual WSIs without requiring large training datasets.

Findings

01

LC-MIL significantly improves annotation accuracy.

02

Outperforms state-of-the-art methods on diverse cancer datasets.

03

Effective in refining real pathologist annotations.

Abstract

Annotating cancerous regions in whole-slide images (WSIs) of pathology samples plays a critical role in clinical diagnosis, biomedical research, and machine learning algorithms development. However, generating exhaustive and accurate annotations is labor-intensive, challenging, and costly. Drawing only coarse and approximate annotations is a much easier task, less costly, and it alleviates pathologists' workload. In this paper, we study the problem of refining these approximate annotations in digital pathology to obtain more accurate ones. Some previous works have explored obtaining machine learning models from these inaccurate annotations, but few of them tackle the refinement problem where the mislabeled regions should be explicitly identified and corrected, and all of them require a -- often very large -- number of training samples. We present a method, named Label Cleaning Multiple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sulam-group/mil-pathology
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Digital Imaging for Blood Diseases · Generative Adversarial Networks and Image Synthesis