Optimized latent-code selection for explainable conditional   text-to-image GANs

Zhenxing Zhang; Lambert Schomaker

arXiv:2204.12678·cs.CV·April 28, 2022

Optimized latent-code selection for explainable conditional text-to-image GANs

Zhenxing Zhang, Lambert Schomaker

PDF

Open Access

TL;DR

This paper explores techniques to analyze and interpret the latent and semantic spaces of conditional text-to-image GANs, enhancing explainability and understanding of model behavior.

Contribution

It introduces interpolation methods and a Good/Bad dataset with a framework to identify effective latent codes, improving interpretability of GANs.

Findings

01

Over 94% accuracy in classifying latent codes as Good or Bad

02

Effective visualization of learned representations through interpolation

03

Public availability of the Good/Bad dataset for further research

Abstract

The task of text-to-image generation has achieved remarkable progress due to the advances in the conditional generative adversarial networks (GANs). However, existing conditional text-to-image GANs approaches mostly concentrate on improving both image quality and semantic relevance but ignore the explainability of the model which plays a vital role in real-world applications. In this paper, we present a variety of techniques to take a deep look into the latent space and semantic space of the conditional text-to-image GANs model. We introduce pairwise linear interpolation of latent codes and `linguistic' linear interpolation to study what the model has learned within the latent space and `linguistic' embeddings. Subsequently, we extend linear interpolation to triangular interpolation conditioned on three corners to further analyze the model. After that, we build a Good/Bad data set…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computational and Text Analysis Methods

MethodsSupport Vector Machine