GANzilla: User-Driven Direction Discovery in Generative Adversarial   Networks

Noyan Evirgen; Xiang 'Anthony' Chen

arXiv:2207.08320·cs.HC·August 16, 2022

GANzilla: User-Driven Direction Discovery in Generative Adversarial Networks

Noyan Evirgen, Xiang 'Anthony' Chen

PDF

TL;DR

GANzilla is a user-driven tool that enables non-expert users to discover meaningful editing directions in GANs through an iterative scatter/gather approach, facilitating targeted image modifications.

Contribution

It introduces a novel interactive method allowing users to control GAN outputs without requiring algorithmic expertise, complementing existing algorithm-driven approaches.

Findings

01

Users successfully discovered editing directions for specific image modifications.

02

Participants achieved high-level goals like making faces appear happier.

03

The tool demonstrated diversity in user-generated editing directions.

Abstract

Generative Adversarial Network (GAN) is widely adopted in numerous application areas, such as data preprocessing, image editing, and creativity support. However, GAN's 'black box' nature prevents non-expert users from controlling what data a model generates, spawning a plethora of prior work that focused on algorithm-driven approaches to extract editing directions to control GAN. Complementarily, we propose a GANzilla: a user-driven tool that empowers a user with the classic scatter/gather technique to iteratively discover directions to meet their editing goals. In a study with 12 participants, GANzilla users were able to discover directions that (i) edited images to match provided examples (closed-ended tasks) and that (ii) met a high-level goal, e.g., making the face happier, while showing diversity across individuals (open-ended tasks).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.