DiffMix: Diffusion Model-based Data Synthesis for Nuclei Segmentation and Classification in Imbalanced Pathology Image Datasets
Hyun-Jic Oh, Won-Ki Jeong

TL;DR
This paper introduces DiffMix, a diffusion model-based data synthesis approach that enhances nuclei segmentation and classification in imbalanced pathology datasets by generating realistic virtual samples to improve rare class recognition.
Contribution
The paper presents a novel diffusion model-based data augmentation method specifically designed for imbalanced pathology image datasets, improving rare nuclei classification and segmentation performance.
Findings
Improved classification accuracy for rare nuclei types.
Enhanced segmentation quality in imbalanced datasets.
Outperforms state-of-the-art methods on two datasets.
Abstract
Nuclei segmentation and classification is a significant process in pathology image analysis. Deep learning-based approaches have greatly contributed to the higher accuracy of this task. However, those approaches suffer from the imbalanced nuclei data composition, which shows lower classification performance on the rare nuclei class. In this paper, we propose a realistic data synthesis method using a diffusion model. We generate two types of virtual patches to enlarge the training data distribution, which is for balancing the nuclei class variance and for enlarging the chance to look at various nuclei. After that, we use a semantic-label-conditioned diffusion model to generate realistic and high-quality image samples. We demonstrate the efficacy of our method by experiment results on two imbalanced nuclei datasets, improving the state-of-the-art networks. The experimental results suggest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Medical Imaging and Analysis · Radiomics and Machine Learning in Medical Imaging
MethodsDiffusion
