Training on Art Composition Attributes to Influence CycleGAN Art   Generation

Holly Grimm

arXiv:1812.07710·cs.LG·December 20, 2018

Training on Art Composition Attributes to Influence CycleGAN Art Generation

Holly Grimm

PDF

Open Access

TL;DR

This paper introduces a method to guide CycleGAN image translation by integrating an Art Composition Attributes Network trained on art evaluation rules, enhancing control over generated art images.

Contribution

The paper presents a novel approach of using an auxiliary neural network trained on art composition attributes to influence CycleGAN outputs.

Findings

01

ACAN effectively encodes art composition attributes.

02

Incorporating ACAN improves control over CycleGAN translation.

03

Method leverages art domain knowledge for better image synthesis.

Abstract

I consider how to influence CycleGAN, image-to-image translation, by using additional constraints from a neural network trained on art composition attributes. I show how I trained the the Art Composition Attributes Network (ACAN) by incorporating domain knowledge based on the rules of art evaluation and the result of applying each art composition attribute to apple2orange image translation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Surveying and Cultural Heritage · 3D Shape Modeling and Analysis · Digital Media and Visual Art

MethodsBatch Normalization · Residual Connection · PatchGAN · *Communicated@Fast*How Do I Communicate to Expedia? · Tanh Activation · Residual Block · Instance Normalization · Convolution · HuMan(Expedia)||How do I get a human at Expedia? · Sigmoid Activation

Full text

Training on Art Composition Attributes to Influence CycleGAN Art Generation

Holly Grimm

Independent Software Developer and Artist

[email protected]

Abstract

I consider how to influence CycleGAN, image-to-image translation, by using additional constraints from a neural network trained on art composition attributes. I show how I trained the the Art Composition Attributes Network (ACAN) by incorporating domain knowledge based on the rules of art evaluation and the result of applying each art composition attribute to apple2orange image translation.

1 Introduction

The standard adversarial and cyclical losses of a CycleGAN [1] were augmented with additional loss terms from a convolutional neural network trained with art composition attributes. During training of the CycleGAN, the user specifies values for each of the art composition attributes. For instance, if a target contrast value of 10 is specified, the generator should output images with more contrast than if the target contrast value is 1.

1.1 Art Composition Attributes

Eight art composition attributes were selected: variety of texture, variety of shape, variety of size, variety of color, contrast, repetition, primary color, and color harmony. Five hundred images from the WikiArt dataset [2] were labeled with these attributes. Figures 1, 2, 3 and 4 are examples of low and high values for variety of texture and contrast.

2 ACAN

Training consisted of fine-tuning a ResNet50 [3] pretrained on the ImageNet dataset. ResNet50 is a fifty-layer deep residual network with 16 residual blocks. Global Average Pooling (GAP) is applied to the ReLU output from each of the sixteen ResNet block activations, called rectified convolution maps [4]. The sixteen GAP outputs were concatenated and L2 normalization was applied to create a merge layer. From the merge layer, there are eight outputs, one for each of the attributes.

3 CycleGAN and ACAN

In addition to the standard CycleGAN losses (Adversarial, Cycle-Consistency, and Identity) the ACAN losses are a series of eight losses generated when the translated image is passed through the ACAN with eight target values. The difference between these target values and the values output by the network are the attribute losses.

4 Results

Below is a sampling of the results of running the CycleGAN training with ACAN on the apple2orange dataset. Even with a small training set size of 500 images, the ACAN is able to learn and generate apples with the eight art compositional attributes.

The most surprising result of this project is the painterly effect that the ACAN was able to inject into the CycleGAN generated images as seen in Figures 18 and 19.

Acknowledgments

This project was initially developed as part of the 2018 OpenAI Scholars program. I would like to thank my mentor, Christy Dennison from OpenAI, for her helpful comments along with support from Larissa Schiavo, Joshua Achiam, Jack Clark, and Greg Brockman from OpenAI.

References

[1] Zhu, J. & Park, T. & Isola, P. & Efros, A.A. (2017) Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. arXiv:1703.10593.

[2] Nichol, K. (2016) Kaggle dataset: Painter by numbers.

https://www.kaggle.com/c/painter-by-numbers.

[3] He, K. & Zhang, X. & Ren, S. & Sun, J. (2015) Deep Residual Learning for Image Recognition arXiv:1512.03385.

[4] Malu, G. & Bapi, R.S. & Indurkhya, B. (2017) Learning Photography Aesthetics with Deep CNNs. arXiv:1707.03981.