R&D: Balancing Reliability and Diversity in Synthetic Data Augmentation for Semantic Segmentation
Huy Che, Dinh-Duy Phan, Duc-Khai Lam

TL;DR
This paper introduces a novel synthetic data augmentation pipeline using controllable diffusion models to improve semantic segmentation by balancing diversity and reliability, especially in data-scarce scenarios.
Contribution
The work presents a new augmentation method that integrates class-aware prompting and visual prior blending with diffusion models for pixel-level tasks.
Findings
Significantly improves segmentation performance on PASCAL VOC and BDD100K datasets.
Enhances model robustness in real-world applications.
Effective in data-scarce scenarios.
Abstract
Collecting and annotating datasets for pixel-level semantic segmentation tasks are highly labor-intensive. Data augmentation provides a viable solution by enhancing model generalization without additional real-world data collection. Traditional augmentation techniques, such as translation, scaling, and color transformations, create geometric variations but fail to generate new structures. While generative models have been employed to extend semantic information of datasets, they often struggle to maintain consistency between the original and generated images, particularly for pixel-level tasks. In this work, we propose a novel synthetic data augmentation pipeline that integrates controllable diffusion models. Our approach balances diversity and reliability data, effectively bridging the gap between synthetic and real data. We utilize class-aware prompting and visual prior blending to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
