cycle text2face: cycle text-to-face gan via transformers

Faezeh Gholamrezaie; Mohammad Manthouri

arXiv:2206.04503·cs.CV·June 10, 2022

cycle text2face: cycle text-to-face gan via transformers

Faezeh Gholamrezaie, Mohammad Manthouri

PDF

Open Access

TL;DR

Cycle Text2Face introduces a novel encoder-decoder GAN model utilizing transformers for detailed text-to-face generation, achieving superior quality and speed compared to previous models on the CelebA dataset.

Contribution

The paper presents a new encoder-decoder architecture combining sentence transformers and GANs for text-to-face synthesis, with cycle consistency for improved results.

Findings

01

Achieved an FID score of 3.458 on CelebA.

02

Outperformed previous GAN-based text-to-face models.

03

Provided high-speed face image generation.

Abstract

Text-to-face is a subset of text-to-image that require more complex architecture due to their more detailed production. In this paper, we present an encoder-decoder model called Cycle Text2Face. Cycle Text2Face is a new initiative in the encoder part, it uses a sentence transformer and GAN to generate the image described by the text. The Cycle is completed by reproducing the text of the face in the decoder part of the model. Evaluating the model using the CelebA dataset, leads to better results than previous GAN-based models. In measuring the quality of the generate face, in addition to satisfying the human audience, we obtain an FID score of 3.458. This model, with high-speed processing, provides quality face images in the short time.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis