OpenPath: Open-Set Active Learning for Pathology Image Classification via Pre-trained Vision-Language Models
Lanfeng Zhong, Xin Liao, Shichuan Zhang, Shaoting Zhang, and Guotai Wang

TL;DR
OpenPath introduces an open-set active learning framework for pathology image classification that effectively distinguishes between in-distribution and out-of-distribution samples using pre-trained vision-language models, reducing annotation costs.
Contribution
The paper presents a novel open-set active learning method leveraging pre-trained vision-language models with task-specific prompts and diverse sampling strategies, addressing OOD data in pathology image classification.
Findings
Significantly improves model performance with high purity sample selection.
Outperforms state-of-the-art open-set active learning methods.
Effectively reduces annotation costs in real-world clinical scenarios.
Abstract
Pathology image classification plays a crucial role in accurate medical diagnosis and treatment planning. Training high-performance models for this task typically requires large-scale annotated datasets, which are both expensive and time-consuming to acquire. Active Learning (AL) offers a solution by iteratively selecting the most informative samples for annotation, thereby reducing the labeling effort. However, most AL methods are designed under the assumption of a closed-set scenario, where all the unannotated images belong to target classes. In real-world clinical environments, the unlabeled pool often contains a substantial amount of Out-Of-Distribution (OOD) data, leading to low efficiency of annotation in traditional AL methods. Furthermore, most existing AL methods start with random selection in the first query round, leading to a significant waste of labeling costs in open-set…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · AI in cancer detection · Domain Adaptation and Few-Shot Learning
