Reproducibility Study of "ITI-GEN: Inclusive Text-to-Image Generation"
Daniel Gallo Fern\'andez, R\u{a}zvan-Andrei Matisan, Alejandro Monroy, Mu\~noz, Janusz Partyka

TL;DR
This paper reproduces and verifies the claims of ITI-GEN, a model designed to improve inclusiveness in text-to-image generation, and proposes enhancements to address its limitations.
Contribution
It confirms ITI-GEN's effectiveness and introduces Hard Prompt Search with negative prompting to mitigate its issues, especially with attribute disentanglement and scalability.
Findings
ITI-GEN improves diversity and quality of generated images
Hard Prompt Search with negative prompting enhances negation handling
Scaling to multiple attributes remains computationally challenging
Abstract
Text-to-image generative models often present issues regarding fairness with respect to certain sensitive attributes, such as gender or skin tone. This study aims to reproduce the results presented in "ITI-GEN: Inclusive Text-to-Image Generation" by Zhang et al. (2023a), which introduces a model to improve inclusiveness in these kinds of models. We show that most of the claims made by the authors about ITI-GEN hold: it improves the diversity and quality of generated images, it is scalable to different domains, it has plug-and-play capabilities, and it is efficient from a computational point of view. However, ITI-GEN sometimes uses undesired attributes as proxy features and it is unable to disentangle some pairs of (correlated) attributes such as gender and baldness. In addition, when the number of considered attributes increases, the training time grows exponentially and ITI-GEN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsComputational and Text Analysis Methods
