MUSE: Textual Attributes Guided Portrait Painting Generation

Xiaodan Hu; Pengfei Yu; Kevin Knight; Heng Ji; Bo Li; Honghui Shi

arXiv:2011.04761·cs.CV·September 21, 2021

MUSE: Textual Attributes Guided Portrait Painting Generation

Xiaodan Hu, Pengfei Yu, Kevin Knight, Heng Ji, Bo Li, Honghui Shi

PDF

Open Access 1 Repo

TL;DR

MUSE is a novel neural network approach that generates personalized portraits from facial features and textual attributes, significantly improving visual fidelity and attribute preservation over existing methods.

Contribution

The paper introduces a stacked neural network architecture that incorporates textual attributes into portrait generation, enabling more expressive and accurate representations.

Findings

01

Inception Score increased by 6%

02

FID score decreased by 11%

03

78% attribute accuracy in generated portraits

Abstract

We propose a novel approach, MUSE, to illustrate textual attributes visually via portrait generation. MUSE takes a set of attributes written in text, in addition to facial features extracted from a photo of the subject as input. We propose 11 attribute types to represent inspirations from a subject's profile, emotion, story, and environment. We propose a novel stacked neural network architecture by extending an image-to-image generative model to accept textual attributes. Experiments show that our approach significantly outperforms several state-of-the-art methods without using textual attributes, with Inception Score score increased by 6% and Fr\'echet Inception Distance (FID) score decreased by 11%, respectively. We also propose a new attribute reconstruction metric to evaluate whether the generated portraits preserve the subject's attributes. Experiments show that our approach can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xiaodanhu/MUSE
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image Retrieval and Classification Techniques · Multimodal Machine Learning Applications