Content-Aware Preserving Image Generation
Giang H. Le, Anh Q. Nguyen, Byeongkeun Kang, Yeejin Lee

TL;DR
This paper introduces a novel image generation framework that explicitly incorporates desired content into generated images by using advanced encoding, frequency analysis, and content fusion modules, enabling precise content control.
Contribution
The proposed framework uniquely combines frequency encoding and content fusion modules to improve content preservation in image generation tasks.
Findings
Effective content preservation demonstrated on benchmark datasets
Produces diverse stylistic variations while maintaining content
Outperforms existing methods in content control accuracy
Abstract
Remarkable progress has been achieved in image generation with the introduction of generative models. However, precisely controlling the content in generated images remains a challenging task due to their fundamental training objective. This paper addresses this challenge by proposing a novel image generation framework explicitly designed to incorporate desired content in output images. The framework utilizes advanced encoding techniques, integrating subnetworks called content fusion and frequency encoding modules. The frequency encoding module first captures features and structures of reference images by exclusively focusing on selected frequency components. Subsequently, the content fusion module generates a content-guiding vector that encapsulates desired content features. During the image generation process, content-guiding vectors from real images are fused with projected noise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Image Processing and 3D Reconstruction · Advanced Image and Video Retrieval Techniques
