FilterPrompt: A Simple yet Efficient Approach to Guide Image Appearance Transfer in Diffusion Models
Xi Wang, Yichen Peng, Heng Fang, Yilin Wang, Haoran Xie, Xi Yang,, Chuntao Li

TL;DR
FilterPrompt introduces a universal method to improve controllable image generation in diffusion models by manipulating pixel-space features, enabling more precise and flexible control over generated image attributes.
Contribution
The paper presents FilterPrompt, a novel pixel-space operation that enhances controllability in diffusion models, addressing limitations of feature-space disentanglement methods.
Findings
Optimizes feature correlation in generated images
Reduces content conflicts during diffusion process
Improves control over specific image features
Abstract
In controllable generation tasks, flexibly manipulating the generated images to attain a desired appearance or structure based on a single input image cue remains a critical and longstanding challenge. Achieving this requires the effective decoupling of key attributes within the input image data to achieve representations accurately. Previous works have concentrated predominantly on disentangling image attributes within feature space. However, the complex distribution present in real-world data often makes the application of such decoupling algorithms to other datasets challenging. Moreover, the granularity of control over feature encoding frequently fails to meet specific task requirements. Upon scrutinizing the characteristics of various generative models, we have observed that the input sensitivity and dynamic evolution properties of the diffusion model can be effectively fused with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods
MethodsDiffusion
