# Unlabeled Insight, Labeled Boost: Contrastive Learning and Class-Adaptive Pseudo-Labeling for Semi-Supervised Medical Image Classification

**Authors:** Jing Yang, Mingliang Chen, Qinhao Jia, Shuxian Liu

PMC · DOI: 10.3390/e27101015 · Entropy · 2025-09-27

## TL;DR

This paper introduces a new semi-supervised framework for medical image classification that improves performance when labeled data is scarce and class imbalances exist.

## Contribution

The novel framework combines contrastive learning with class-adaptive pseudo-labeling to enhance performance in annotation-scarce and class-imbalanced medical imaging tasks.

## Key findings

- The proposed CLCP-MT framework achieves a 10.38% F1-score improvement on the ISIC2018 dataset with only 20% labeled data.
- It also shows a 2.64% AUC increase on the Chest X-ray14 dataset under similar conditions.
- The framework effectively reduces head-class dominance and improves tail-class recognition in medical image classification.

## Abstract

The medical imaging domain frequently encounters the dual challenges of annotation scarcity and class imbalance. A critical issue lies in effectively extracting information from limited labeled data while mitigating the dominance of head classes. The existing approaches often overlook in-depth modeling of sample relationships in low-dimensional spaces, while rigid or suboptimal dynamic thresholding strategies in pseudo-label generation are susceptible to noisy label interference, leading to cumulative bias amplification during the early training phases. To address these issues, we propose a semi-supervised medical image classification framework combining labeled data-contrastive learning with class-adaptive pseudo-labeling (CLCP-MT), comprising two key components: the semantic discrimination enhancement (SDE) module and the class-adaptive pseudo-label refinement (CAPR) module. The former incorporates supervised contrastive learning on limited labeled data to fully exploit discriminative information in latent structural spaces, thereby significantly amplifying the value of sparse annotations. The latter dynamically calibrates pseudo-label confidence thresholds according to real-time learning progress across different classes, effectively reducing head-class dominance while enhancing tail-class recognition performance. These synergistic modules collectively achieve breakthroughs in both information utilization efficiency and model robustness, demonstrating superior performance in class-imbalanced scenarios. Extensive experiments on the ISIC2018 skin lesion dataset and Chest X-ray14 thoracic disease dataset validate CLCP-MT’s efficacy. With only 20% labeled and 80% unlabeled data, our framework achieves a 10.38% F1-score improvement on ISIC2018 and a 2.64% AUC increase on Chest X-ray14 compared to the baselines, confirming its effectiveness and superiority under annotation-deficient and class-imbalanced conditions.

## Full-text entities

- **Diseases:** skin lesion (MESH:D012871), thoracic disease (MESH:D013896)
- **Chemicals:** CLCP-MT (-)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12563532/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12563532/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/PMC12563532/full.md

---
Source: https://tomesphere.com/paper/PMC12563532