CIEGAD: Cluster-Conditioned Interpolative and Extrapolative Framework for Geometry-Aware and Domain-Aligned Data Augmentation
Keito Inoshita, Xiaokang Zhou, Akira Kawai, Katsutoshi Yada

TL;DR
CIEGAD is a novel data augmentation framework that enhances deep learning models by systematically generating semantically diverse, domain-aligned, and high-quality data in both in-distribution and out-of-distribution regions, especially benefiting long-tailed and multi-class tasks.
Contribution
It introduces a cluster-conditioned, geometry-aware augmentation method combining interpolative and extrapolative synthesis with quality control, filling a gap in integrated, controllable data augmentation frameworks.
Findings
Improves F1 and recall in long-tailed classification tasks.
Effectively extends data distribution peripheries while maintaining alignment and diversity.
Demonstrates robustness across multiple classification benchmarks.
Abstract
In practical deep learning deployment, the scarcity of data and the imbalance of label distributions often lead to semantically uncovered regions within the real-world data distribution, hindering model training and causing misclassification near class boundaries as well as unstable behaviors in peripheral areas. Although recent large language models (LLMs) show promise for data augmentation, an integrated framework that simultaneously achieves directional control of generation, domain alignment, and quality control has not yet been fully established. To address these challenges, we propose a Cluster-conditioned Interpolative and Extrapolative framework for Geometry-Aware and Domain-aligned data augmentation (CIEGAD), which systematically complements both in-distribution and out-of-distribution semantically uncovered regions. CIEGAD constructs domain profiles through cluster…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Graph Neural Networks · Topic Modeling
