InstructAttribute: Fine-grained Object Attributes editing with Instruction
Xingxi Yin, Jingfeng Zhang, Yue Deng, Zhi Li, Yicheng Li, Yin Zhang

TL;DR
InstructAttribute is a novel instruction-tuned model that enables precise, fine-grained editing of object attributes like color and material in images, leveraging a new training-free framework and large language models for data curation.
Contribution
The paper introduces InstructAttribute, a new method for object attribute editing that combines a training-free framework with large language models for data generation, improving accuracy and structural preservation.
Findings
Outperforms existing instruction-based methods in attribute editing accuracy
Achieves a good balance between attribute modification and image structural integrity
Enables practical applications in product design, e-commerce, and virtual try-on
Abstract
Text-to-image (T2I) diffusion models are widely used in image editing due to their powerful generative capabilities. However, achieving fine-grained control over specific object attributes, such as color and material, remains a considerable challenge. Existing methods often fail to accurately modify these attributes or compromise structural integrity and overall image consistency. To fill this gap, we introduce Structure Preservation and Attribute Amplification (SPAA), a novel training-free framework that enables precise generation of color and material attributes for the same object by intelligently manipulating self-attention maps and cross-attention values within diffusion models. Building on SPAA, we integrate multi-modal large language models (MLLMs) to automate data curation and instruction generation. Leveraging this object attribute data collection engine, we construct the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Modular Robots and Swarm Intelligence · Robot Manipulation and Learning
