AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models
Die Chen, Zhongjie Duan, Zhiwen Li, Cen Chen, Daoyuan Chen, Yaliang Li, Yingda Chen

TL;DR
AttriCtrl introduces a lightweight, plug-and-play framework that enables precise, continuous control over multiple aesthetic attributes in diffusion models, enhancing personalization and diversity without retraining the entire model.
Contribution
It proposes a novel hybrid strategy for quantifying aesthetic attributes and a plug-and-play encoder for continuous control, addressing the limitations of existing discrete or global preference-based methods.
Findings
Achieves accurate, continuous control over multiple aesthetic attributes.
Enhances personalization and diversity in image generation.
Maintains model performance with negligible computational overhead.
Abstract
Diffusion models have recently become the dominant paradigm for image generation, yet existing systems struggle to interpret and follow numeric instructions for adjusting semantic attributes. In real-world creative scenarios, especially when precise control over aesthetic attributes is required, current methods fail to provide such controllability. This limitation partly arises from the subjective and context-dependent nature of aesthetic judgments, but more fundamentally stems from the fact that current text encoders are designed for discrete tokens rather than continuous values. Meanwhile, efforts on aesthetic alignment, often leveraging reinforcement learning, direct preference optimization, or architectural modifications, primarily align models with a global notion of human preference. While these approaches improve user experience, they overlook the multifaceted and compositional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Aesthetic Perception and Analysis · Generative Adversarial Networks and Image Synthesis
