Bi-level Feature Alignment for Versatile Image Translation and Manipulation
Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Kaiwen Cui,, Aoran Xiao, Shijian Lu, Chunyan Miao

TL;DR
This paper introduces a bi-level feature alignment framework for versatile image translation and manipulation, improving style control and semantic guidance while reducing memory costs through a novel approximation of the top-k operation.
Contribution
It proposes a bi-level feature alignment strategy with a differentiable approximation of top-k, a semantic position encoding, and a confidence feature injection module for improved image translation.
Findings
Achieves superior qualitative and quantitative performance.
Reduces memory cost significantly compared to dense correspondence methods.
Effectively preserves texture structures and style control.
Abstract
Generative adversarial networks (GANs) have achieved great success in image translation and manipulation. However, high-fidelity image generation with faithful style control remains a grand challenge in computer vision. This paper presents a versatile image translation and manipulation framework that achieves accurate semantic and style guidance in image generation by explicitly building a correspondence. To handle the quadratic complexity incurred by building the dense correspondences, we introduce a bi-level feature alignment strategy that adopts a top- operation to rank block-wise features followed by dense attention between block features which reduces memory cost substantially. As the top- operation involves index swapping which precludes the gradient propagation, we approximate the non-differentiable top- operation with a regularized earth mover's problem so that its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Digital Media Forensic Detection
