InstructAttribute: Fine-grained Object Attributes editing with Instruction

Xingxi Yin; Jingfeng Zhang; Yue Deng; Zhi Li; Yicheng Li; Yin Zhang

arXiv:2505.00751·cs.CV·June 24, 2025

InstructAttribute: Fine-grained Object Attributes editing with Instruction

Xingxi Yin, Jingfeng Zhang, Yue Deng, Zhi Li, Yicheng Li, Yin Zhang

PDF

Open Access

TL;DR

InstructAttribute is a novel instruction-tuned model that enables precise, fine-grained editing of object attributes like color and material in images, leveraging a new training-free framework and large language models for data curation.

Contribution

The paper introduces InstructAttribute, a new method for object attribute editing that combines a training-free framework with large language models for data generation, improving accuracy and structural preservation.

Findings

01

Outperforms existing instruction-based methods in attribute editing accuracy

02

Achieves a good balance between attribute modification and image structural integrity

03

Enables practical applications in product design, e-commerce, and virtual try-on

Abstract

Text-to-image (T2I) diffusion models are widely used in image editing due to their powerful generative capabilities. However, achieving fine-grained control over specific object attributes, such as color and material, remains a considerable challenge. Existing methods often fail to accurately modify these attributes or compromise structural integrity and overall image consistency. To fill this gap, we introduce Structure Preservation and Attribute Amplification (SPAA), a novel training-free framework that enables precise generation of color and material attributes for the same object by intelligently manipulating self-attention maps and cross-attention values within diffusion models. Building on SPAA, we integrate multi-modal large language models (MLLMs) to automate data curation and instruction generation. Leveraging this object attribute data collection engine, we construct the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Modular Robots and Swarm Intelligence · Robot Manipulation and Learning