DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
Weijia Wu, Yuzhong Zhao, Hao Chen, Yuchao Gu, Rui Zhao, Yefei He, Hong, Zhou, Mike Zheng Shou, Chunhua Shen

TL;DR
DatasetDM leverages diffusion models to generate diverse, annotated synthetic datasets for perception tasks, significantly reducing manual labeling effort and improving model performance and robustness across various computer vision applications.
Contribution
The paper introduces DatasetDM, a novel method that uses diffusion models and minimal manual labeling to generate large-scale, high-quality annotated datasets for perception tasks.
Findings
Achieves state-of-the-art results in semantic and instance segmentation.
Enhances domain generalization robustness.
Excels in zero-shot segmentation scenarios.
Abstract
Current deep networks are very data-hungry and benefit from training on largescale datasets, which are often time-consuming to collect and annotate. By contrast, synthetic data can be generated infinitely using generative models such as DALL-E and diffusion models, with minimal effort and cost. In this paper, we present DatasetDM, a generic dataset generation model that can produce diverse synthetic images and the corresponding high-quality perception annotations (e.g., segmentation masks, and depth). Our method builds upon the pre-trained diffusion model and extends text-guided image synthesis to perception data generation. We show that the rich latent code of the diffusion model can be effectively decoded as accurate perception annotations using a decoder module. Training the decoder only needs less than 1% (around 100 images) manually labeled images, enabling the generation of an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
MethodsDiffusion
