DeFLOCNet: Deep Image Editing via Flexible Low-level Controls
Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao,, Bing Jiang, Wei Liu

TL;DR
DeFLOCNet is a deep learning model that improves image editing by directly integrating low-level user controls into the CNN's feature space, enabling more faithful and detailed content creation.
Contribution
It introduces a novel method of injecting low-level controls into each structure generation block within a deep encoder-decoder CNN for enhanced image editing fidelity.
Findings
Effective transformation of user controls into visual content
Improved fidelity in structure and texture generation
Outperforms existing methods on benchmark datasets
Abstract
User-intended visual content fills the hole regions of an input image in the image editing scenario. The coarse low-level inputs, which typically consist of sparse sketch lines and color dots, convey user intentions for content creation (\ie, free-form editing). While existing methods combine an input image and these low-level controls for CNN inputs, the corresponding feature representations are not sufficient to convey user intentions, leading to unfaithfully generated content. In this paper, we propose DeFLOCNet which relies on a deep encoder-decoder CNN to retain the guidance of these controls in the deep feature representations. In each skip-connection layer, we design a structure generation block. Instead of attaching low-level controls to an input image, we inject these controls directly into each structure generation block for sketch line refinement and color propagation in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Advanced Vision and Imaging
MethodsSpatially-Adaptive Normalization
