Diverse Semantic Image Editing with Style Codes
Hakan Sivuk, Aysegul Dundar

TL;DR
This paper introduces a novel semantic image editing framework that encodes styles of visible and partially visible objects separately, enabling diverse, high-quality edits with seamless boundaries and improved consistency.
Contribution
It proposes a new mechanism for style encoding of objects in semantic image editing, enhancing diversity, consistency, and boundary seamlessness over previous methods.
Findings
Significantly outperforms previous semantic editing methods
Produces more diverse and realistic edited images
Achieves better quantitative metrics in experiments
Abstract
Semantic image editing requires inpainting pixels following a semantic map. It is a challenging task since this inpainting requires both harmony with the context and strict compliance with the semantic maps. The majority of the previous methods proposed for this task try to encode the whole information from erased images. However, when an object is added to a scene such as a car, its style cannot be encoded from the context alone. On the other hand, the models that can output diverse generations struggle to output images that have seamless boundaries between the generated and unerased parts. Additionally, previous methods do not have a mechanism to encode the styles of visible and partially visible objects differently for better performance. In this work, we propose a framework that can encode visible and partially visible objects with a novel mechanism to achieve consistency in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Cell Image Analysis Techniques
MethodsInpainting
