AtteConDA: Attention-Based Conflict Suppression in Multi-Condition Diffusion Models and Synthetic Data Augmentation
Shogo Noguchi

TL;DR
This paper introduces AtteConDA, a multi-condition diffusion model with conflict suppression to generate structurally faithful images for autonomous driving, enhancing data augmentation and high-level scene understanding.
Contribution
It proposes a novel conflict modeling approach for multi-condition image generation, improving structural preservation in synthetic data for autonomous driving tasks.
Findings
The conflict-aware model enables better structural fidelity in generated images.
The framework improves data augmentation for high-level driving tasks.
Evaluation shows enhanced scene consistency compared to baseline methods.
Abstract
Recent conditional image generation methods can improve controllability by generating images that are faithful to conditions such as sketches, human poses, segmentation maps, and depth. By applying these techniques to image augmentation while preserving annotations, generated images can be used as additional training data and can improve recognition performance. However, for high-level driving tasks such as traffic-rule extraction and driving-behavior understanding, simply using annotations as conditions is insufficient. Instead, images must be augmented while preserving the detailed high-level structure of the original scene. One possible solution is to use multiple conditions so that generated images retain diverse structural cues after generation. However, when multiple conditions are used, conflicts among conditions can prevent reliable structure preservation. In this work, we input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
