R&D: Balancing Reliability and Diversity in Synthetic Data Augmentation for Semantic Segmentation

Huy Che; Dinh-Duy Phan; Duc-Khai Lam

arXiv:2603.18427·cs.CV·March 20, 2026·ICCCI

R&D: Balancing Reliability and Diversity in Synthetic Data Augmentation for Semantic Segmentation

Huy Che, Dinh-Duy Phan, Duc-Khai Lam

PDF

Open Access

TL;DR

This paper introduces a novel synthetic data augmentation pipeline using controllable diffusion models to improve semantic segmentation by balancing diversity and reliability, especially in data-scarce scenarios.

Contribution

The work presents a new augmentation method that integrates class-aware prompting and visual prior blending with diffusion models for pixel-level tasks.

Findings

01

Significantly improves segmentation performance on PASCAL VOC and BDD100K datasets.

02

Enhances model robustness in real-world applications.

03

Effective in data-scarce scenarios.

Abstract

Collecting and annotating datasets for pixel-level semantic segmentation tasks are highly labor-intensive. Data augmentation provides a viable solution by enhancing model generalization without additional real-world data collection. Traditional augmentation techniques, such as translation, scaling, and color transformations, create geometric variations but fail to generate new structures. While generative models have been employed to extend semantic information of datasets, they often struggle to maintain consistency between the original and generated images, particularly for pixel-level tasks. In this work, we propose a novel synthetic data augmentation pipeline that integrates controllable diffusion models. Our approach balances diversity and reliability data, effectively bridging the gap between synthetic and real data. We utilize class-aware prompting and visual prior blending to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning