USegMix: Unsupervised Segment Mix for Efficient Data Augmentation in Pathology Images
Jiamu Wang, Jin Tae Kwak

TL;DR
USegMix is an unsupervised data augmentation technique for pathology images that creates realistic synthetic images by segment replacement and blending, improving cancer classification accuracy.
Contribution
It introduces a novel two-phase unsupervised method combining superpixels, SAM, and diffusion models for realistic pathology image augmentation.
Findings
Enhanced cancer classification performance.
Effective generation of diverse, realistic pathology images.
Applicable to colorectal and prostate cancer datasets.
Abstract
In computational pathology, researchers often face challenges due to the scarcity of labeled pathology datasets. Data augmentation emerges as a crucial technique to mitigate this limitation. In this study, we introduce an efficient data augmentation method for pathology images, called USegMix. Given a set of pathology images, the proposed method generates a new, synthetic image in two phases. In the first phase, USegMix constructs a pool of tissue segments in an automated and unsupervised manner using superpixels and the Segment Anything Model (SAM). In the second phase, USegMix selects a candidate segment in a target image, replaces it with a similar segment from the segment pool, and blends them by using a pre-trained diffusion model. In this way, USegMix can generate diverse and realistic pathology images. We rigorously evaluate the effectiveness of USegMix on two pathology image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Medical Image Segmentation Techniques · Image Retrieval and Classification Techniques
MethodsDiffusion · Sparse Evolutionary Training
