Reproducibility Study of "ITI-GEN: Inclusive Text-to-Image Generation"

Daniel Gallo Fern\'andez; R\u{a}zvan-Andrei Matisan; Alejandro Monroy; Mu\~noz; Janusz Partyka

arXiv:2407.19996·cs.CV·July 30, 2024

Reproducibility Study of "ITI-GEN: Inclusive Text-to-Image Generation"

Daniel Gallo Fern\'andez, R\u{a}zvan-Andrei Matisan, Alejandro Monroy, Mu\~noz, Janusz Partyka

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper reproduces and verifies the claims of ITI-GEN, a model designed to improve inclusiveness in text-to-image generation, and proposes enhancements to address its limitations.

Contribution

It confirms ITI-GEN's effectiveness and introduces Hard Prompt Search with negative prompting to mitigate its issues, especially with attribute disentanglement and scalability.

Findings

01

ITI-GEN improves diversity and quality of generated images

02

Hard Prompt Search with negative prompting enhances negation handling

03

Scaling to multiple attributes remains computationally challenging

Abstract

Text-to-image generative models often present issues regarding fairness with respect to certain sensitive attributes, such as gender or skin tone. This study aims to reproduce the results presented in "ITI-GEN: Inclusive Text-to-Image Generation" by Zhang et al. (2023a), which introduces a model to improve inclusiveness in these kinds of models. We show that most of the claims made by the authors about ITI-GEN hold: it improves the diversity and quality of generated images, it is scalable to different domains, it has plug-and-play capabilities, and it is efficient from a computational point of view. However, ITI-GEN sometimes uses undesired attributes as proxy features and it is unable to disentangle some pairs of (correlated) attributes such as gender and baldness. In addition, when the number of considered attributes increases, the training time grows exponentially and ITI-GEN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amonroym99/iti-gen-reproducibility
pytorchOfficial

Videos

Reproducibility Study of "ITI-GEN: Inclusive Text-to-Image Generation"· slideslive

Taxonomy

TopicsComputational and Text Analysis Methods