GANILLA: Generative Adversarial Networks for Image to Illustration Translation
Samet Hicsonmez, Nermin Samet, Emre Akbas, Pinar Duygulu

TL;DR
This paper introduces GANILLA, a novel GAN-based model for unpaired image-to-illustration translation in children's books, achieving a better balance of style and content transfer, and proposes a new quantitative evaluation framework.
Contribution
GANILLA is a new generator architecture that improves style-content balance in unpaired image-to-illustration translation and includes a novel evaluation framework.
Findings
GANILLA outperforms state-of-the-art models on illustration datasets.
The proposed evaluation framework effectively measures style and content transfer.
The model achieves a better style-content balance in image translation.
Abstract
In this paper, we explore illustrations in children's books as a new domain in unpaired image-to-image translation. We show that although the current state-of-the-art image-to-image translation models successfully transfer either the style or the content, they fail to transfer both at the same time. We propose a new generator network to address this issue and show that the resulting network strikes a better balance between style and content. There are no well-defined or agreed-upon evaluation metrics for unpaired image-to-image translation. So far, the success of image translation models has been based on subjective, qualitative visual comparison on a limited number of images. To address this problem, we propose a new framework for the quantitative evaluation of image-to-illustration models, where both content and style are taken into account using separate classifiers. In this new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Handwritten Text Recognition Techniques · Video Analysis and Summarization
