GANILLA: Generative Adversarial Networks for Image to Illustration   Translation

Samet Hicsonmez; Nermin Samet; Emre Akbas; Pinar Duygulu

arXiv:2002.05638·cs.CV·February 17, 2020·1 cites

GANILLA: Generative Adversarial Networks for Image to Illustration Translation

Samet Hicsonmez, Nermin Samet, Emre Akbas, Pinar Duygulu

PDF

Open Access 4 Repos

TL;DR

This paper introduces GANILLA, a novel GAN-based model for unpaired image-to-illustration translation in children's books, achieving a better balance of style and content transfer, and proposes a new quantitative evaluation framework.

Contribution

GANILLA is a new generator architecture that improves style-content balance in unpaired image-to-illustration translation and includes a novel evaluation framework.

Findings

01

GANILLA outperforms state-of-the-art models on illustration datasets.

02

The proposed evaluation framework effectively measures style and content transfer.

03

The model achieves a better style-content balance in image translation.

Abstract

In this paper, we explore illustrations in children's books as a new domain in unpaired image-to-image translation. We show that although the current state-of-the-art image-to-image translation models successfully transfer either the style or the content, they fail to transfer both at the same time. We propose a new generator network to address this issue and show that the resulting network strikes a better balance between style and content. There are no well-defined or agreed-upon evaluation metrics for unpaired image-to-image translation. So far, the success of image translation models has been based on subjective, qualitative visual comparison on a limited number of images. To address this problem, we propose a new framework for the quantitative evaluation of image-to-illustration models, where both content and style are taken into account using separate classifiers. In this new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Handwritten Text Recognition Techniques · Video Analysis and Summarization