EditGarment: An Instruction-Based Garment Editing Dataset Constructed with Automated MLLM Synthesis and Semantic-Aware Evaluation

Deqiang Yin; Junyi Guo; Huanda Lu; Fangyu Wu; Dongming Lu

arXiv:2508.03497·cs.CV·August 15, 2025

EditGarment: An Instruction-Based Garment Editing Dataset Constructed with Automated MLLM Synthesis and Semantic-Aware Evaluation

Deqiang Yin, Junyi Guo, Huanda Lu, Fangyu Wu, Dongming Lu

PDF

1 Datasets

TL;DR

This paper introduces EditGarment, a large-scale dataset for instruction-based garment editing, created through automated synthesis guided by fashion-specific semantics and evaluation, to advance fashion image editing research.

Contribution

It presents a novel automated pipeline for constructing a high-quality garment editing dataset with semantic-aware supervision and diverse instructions, addressing data scarcity in fashion editing tasks.

Findings

01

Constructed 20,596 high-quality instruction-image triplets.

02

Defined six real-world fashion instruction categories.

03

Introduced Fashion Edit Score for semantic-aware evaluation.

Abstract

Instruction-based garment editing enables precise image modifications via natural language, with broad applications in fashion design and customization. Unlike general editing tasks, it requires understanding garment-specific semantics and attribute dependencies. However, progress is limited by the scarcity of high-quality instruction-image pairs, as manual annotation is costly and hard to scale. While MLLMs have shown promise in automated data synthesis, their application to garment editing is constrained by imprecise instruction modeling and a lack of fashion-specific supervisory signals. To address these challenges, we present an automated pipeline for constructing a garment editing dataset. We first define six editing instruction categories aligned with real-world fashion workflows to guide the generation of balanced and diverse instruction-image triplets. Second, we introduce…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

yindq99/EditGarment
dataset· 3 dl
3 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.