Exploring the Effectiveness of Mask-Guided Feature Modulation as a   Mechanism for Localized Style Editing of Real Images

Snehal Singh Tomar; Maitreya Suin; A.N. Rajagopalan

arXiv:2211.11224·cs.CV·December 13, 2022

Exploring the Effectiveness of Mask-Guided Feature Modulation as a Mechanism for Localized Style Editing of Real Images

Snehal Singh Tomar, Maitreya Suin, A.N. Rajagopalan

PDF

Open Access

TL;DR

This paper introduces SemanticStyle Autoencoder (SSAE), a novel method using mask-guided feature modulation in the latent space for efficient, localized, and photorealistic style editing of real images, avoiding expensive inversion and supervision.

Contribution

It proposes SSAE, a deep autoencoder leveraging semantic mask-guided latent manipulation for localized style editing, reducing computational cost and human supervision.

Findings

01

Effective localized style editing demonstrated

02

Outperforms existing methods in quality and efficiency

03

Provides both qualitative and quantitative validation

Abstract

The success of Deep Generative Models at high-resolution image generation has led to their extensive utilization for style editing of real images. Most existing methods work on the principle of inverting real images onto their latent space, followed by determining controllable directions. Both inversion of real images and determination of controllable latent directions are computationally expensive operations. Moreover, the determination of controllable latent directions requires additional human supervision. This work aims to explore the efficacy of mask-guided feature modulation in the latent space of a Deep Generative Model as a solution to these bottlenecks. To this end, we present the SemanticStyle Autoencoder (SSAE), a deep Generative Autoencoder model that leverages semantic mask-guided latent space manipulation for highly localized photorealistic style editing of real images. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques