A Study of Cross-domain Generative Models applied to Cartoon Series
Eman T. Hassan, David J. Crandall

TL;DR
This paper explores the use of GANs to generate cartoon series images, revealing that models can learn semantic correspondences across different cartoon styles without explicit labels.
Contribution
It demonstrates that GANs can discover semantic-level relationships between distinct cartoon styles in an unsupervised manner.
Findings
GANs can generate cartoon images with semantic consistency.
Models find cross-domain semantic correspondences without labels.
Joint training on multiple cartoon styles is effective.
Abstract
We investigate Generative Adversarial Networks (GANs) to model one particular kind of image: frames from TV cartoons. Cartoons are particularly interesting because their visual appearance emphasizes the important semantic information about a scene while abstracting out the less important details, but each cartoon series has a distinctive artistic style that performs this abstraction in different ways. We consider a dataset consisting of images from two popular television cartoon series, Family Guy and The Simpsons. We examine the ability of GANs to generate images from each of these two domains, when trained independently as well as on both domains jointly. We find that generative models may be capable of finding semantic-level correspondences between these two image domains despite the unsupervised setting, even when the training data does not give labeled alignments between them.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Advanced Image Processing Techniques
