Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning
Rafael Elberg, Denis Parra, Mircea Petrache

TL;DR
This paper introduces a novel data augmentation method using feature space manipulation and iterated learning with latent diffusion models to improve image generation for long-tailed, poorly distributed datasets, especially in medical imaging.
Contribution
It presents a new approach combining feature space augmentation and iterated learning to enhance image generation for under-represented classes in long-tailed datasets.
Findings
Improved generation quality for tail classes
Faster inference compared to traditional diffusion models
Effective augmentation demonstrated on medical imaging datasets
Abstract
Image and multimodal machine learning tasks are very challenging to solve in the case of poorly distributed data. In particular, data availability and privacy restrictions exacerbate these hurdles in the medical domain. The state of the art in image generation quality is held by Latent Diffusion models, making them prime candidates for tackling this problem. However, a few key issues still need to be solved, such as the difficulty in generating data from under-represented classes and a slow inference process. To mitigate these issues, we propose a new method for image augmentation in long-tailed data based on leveraging the rich latent space of pre-trained Stable Diffusion Models. We create a modified separable latent space to mix head and tail class examples. We build this space via Iterated Learning of underlying sparsified embeddings, which we apply to task-specific saliency maps via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Medical Image Segmentation Techniques · Image Retrieval and Classification Techniques
Methodsk-Nearest Neighbors · Diffusion
