Large-Scale Universal Defect Generation: Foundation Models and Datasets

Yuanting Fan; Jun Liu; Bin-Bin Gao; Xiaochen Chen; Yuhuan Lin; Zhewei Dai; Jiawei Zhan; Chengjie Wang

arXiv:2604.08915·cs.CV·April 13, 2026

Large-Scale Universal Defect Generation: Foundation Models and Datasets

Yuanting Fan, Jun Liu, Bin-Bin Gao, Xiaochen Chen, Yuhuan Lin, Zhewei Dai, Jiawei Zhan, Chengjie Wang

PDF

1 Repo 2 Models

TL;DR

This paper introduces UDG, a large-scale defect dataset, and UniDG, a universal foundation model for defect generation that improves diversity, realism, and generalization without per-category fine-tuning.

Contribution

The work presents a novel large-scale defect dataset and a versatile foundation model capable of reference-based and instruction-based defect editing across diverse categories.

Findings

01

UniDG outperforms prior methods in synthesis quality.

02

Extensive experiments show improved anomaly detection and localization.

03

The dataset enables better generalization across defect categories.

Abstract

Existing defect/anomaly generation methods often rely on few-shot learning, which overfits to specific defect categories due to the lack of large-scale paired defect editing data. This issue is aggravated by substantial variations in defect scale and morphology, resulting in limited generalization, degraded realism, and category consistency. We address these challenges by introducing UDG, a large-scale dataset of 300K normal-abnormal-mask-caption quadruplets spanning diverse domains, and by presenting UniDG, a universal defect generation foundation model that supports both reference-based defect generation and text instruction-based defect editing without per-category fine-tuning. UniDG performs Defect-Context Editing via adaptive defect cropping and structured diptych input format, and fuses reference and target conditions through MM-DiT multimodal attention. A two-stage training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

RetoFan233/UniDG
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.