Dataset Augmentation by Mixing Visual Concepts

Abdullah Al Rahat; Hemanth Venkateswara

arXiv:2412.15358·cs.CV·December 23, 2024

Dataset Augmentation by Mixing Visual Concepts

Abdullah Al Rahat, Hemanth Venkateswara

PDF

Open Access

TL;DR

This paper introduces a novel dataset augmentation method using fine-tuned diffusion models with a technique called Mixing Visual Concepts, which generates diverse, realistic images to improve classification performance.

Contribution

The paper presents a new fine-tuning approach with Mixing Visual Concepts for generating diverse images, enhancing dataset augmentation effectiveness.

Findings

01

Outperforms existing augmentation methods on benchmark classification tasks.

02

Produces diverse images with both coarse and fine-grained variations.

03

Effective in reducing domain discrepancy between real and generated data.

Abstract

This paper proposes a dataset augmentation method by fine-tuning pre-trained diffusion models. Generating images using a pre-trained diffusion model with textual conditioning often results in domain discrepancy between real data and generated images. We propose a fine-tuning approach where we adapt the diffusion model by conditioning it with real images and novel text embeddings. We introduce a unique procedure called Mixing Visual Concepts (MVC) where we create novel text embeddings from image captions. The MVC enables us to generate multiple images which are diverse and yet similar to the real data enabling us to perform effective dataset augmentation. We perform comprehensive qualitative and quantitative evaluations with the proposed dataset augmentation approach showcasing both coarse-grained and finegrained changes in generated images. Our approach outperforms state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques

MethodsDiffusion