Information Gain Sampling for Active Learning in Medical Image   Classification

Raghav Mehta; Changjian Shui; Brennan Nichyporuk; Tal Arbel

arXiv:2208.00974·cs.CV·August 2, 2022

Information Gain Sampling for Active Learning in Medical Image Classification

Raghav Mehta, Changjian Shui, Brennan Nichyporuk, Tal Arbel

PDF

Open Access

TL;DR

This paper introduces an information-theoretic active learning method, AEIG, that efficiently selects medical images for labeling, significantly reducing annotation efforts while maintaining high classification performance.

Contribution

The paper proposes AEIG, an adapted expected information gain approach that accounts for class imbalance, improving active learning efficiency in medical image classification.

Findings

01

AEIG outperforms popular baselines like CoreSet and maximum entropy sampling.

02

AEIG achieves ~95% of full data performance with only 19% of training data.

03

The method can be integrated into existing deep learning classifiers.

Abstract

Large, annotated datasets are not widely available in medical image analysis due to the prohibitive time, costs, and challenges associated with labelling large datasets. Unlabelled datasets are easier to obtain, and in many contexts, it would be feasible for an expert to provide labels for a small subset of images. This work presents an information-theoretic active learning framework that guides the optimal selection of images from the unlabelled pool to be labeled based on maximizing the expected information gain (EIG) on an evaluation dataset. Experiments are performed on two different medical image classification datasets: multi-class diabetic retinopathy disease scale classification and multi-class skin lesion classification. Results indicate that by adapting EIG to account for class-imbalances, our proposed Adapted Expected Information Gain (AEIG) outperforms several popular…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Systemic Lupus Erythematosus Research · Domain Adaptation and Few-Shot Learning