IntereStyle: Encoding an Interest Region for Robust StyleGAN Inversion
Seungjun Moon, Gyeong-Moon Park

TL;DR
IntereStyle introduces a novel encoder training scheme that focuses on interest regions in images, improving GAN inversion by reducing distortion and enhancing perceptual quality, especially in complex real-world images.
Contribution
The paper proposes IntereStyle, a new encoder training method that disentangles interest and uninterest regions, addressing the distortion-perception trade-off in GAN inversion.
Findings
Achieves lower distortion and higher perceptual quality than existing encoders.
Robustly preserves original image features for better editing and style mixing.
Effectively filters uninterest region information to improve encoding focus.
Abstract
Recently, manipulation of real-world images has been highly elaborated along with the development of Generative Adversarial Networks (GANs) and corresponding encoders, which embed real-world images into the latent space. However, designing encoders of GAN still remains a challenging task due to the trade-off between distortion and perception. In this paper, we point out that the existing encoders try to lower the distortion not only on the interest region, e.g., human facial region but also on the uninterest region, e.g., background patterns and obstacles. However, most uninterest regions in real-world images are located at out-of-distribution (OOD), which are infeasible to be ideally reconstructed by generative models. Moreover, we empirically find that the uninterest region overlapped with the interest region can mangle the original feature of the interest region, e.g., a microphone…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Advanced Image Processing Techniques
