DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation,   Segmentation and Re-Identification of Clothing Images

Yuying Ge; Ruimao Zhang; Lingyun Wu; Xiaogang Wang; Xiaoou; Tang; Ping Luo

arXiv:1901.07973·cs.CV·January 24, 2019·25 cites

DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images

Yuying Ge, Ruimao Zhang, Lingyun Wu, Xiaogang Wang, Xiaoou, Tang, Ping Luo

PDF

Open Access 5 Repos

TL;DR

DeepFashion2 introduces a comprehensive benchmark dataset with rich annotations for clothing detection, pose estimation, segmentation, and re-identification, addressing limitations of previous datasets and supporting advanced fashion image understanding.

Contribution

It provides a large, richly annotated dataset and a strong baseline model for multiple fashion image analysis tasks, bridging gaps in real-world scenario representation.

Findings

01

DeepFashion2 contains 801K clothing items with detailed annotations.

02

The dataset includes 873K commercial-consumer clothing pairs.

03

Extensive evaluations demonstrate the effectiveness of the proposed baseline.

Abstract

Understanding fashion images has been advanced by benchmarks with rich annotations such as DeepFashion, whose labels include clothing categories, landmarks, and consumer-commercial image pairs. However, DeepFashion has nonnegligible issues such as single clothing-item per image, sparse landmarks (4~8 only), and no per-pixel masks, making it had significant gap from real-world scenarios. We fill in the gap by presenting DeepFashion2 to address these issues. It is a versatile benchmark of four tasks including clothes detection, pose estimation, segmentation, and retrieval. It has 801K clothing items where each item has rich annotations such as style, scale, viewpoint, occlusion, bounding box, dense landmarks and masks. There are also 873K Commercial-Consumer clothes pairs. A strong baseline is proposed, called Match R-CNN, which builds upon Mask R-CNN to solve the above four tasks in an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Face recognition and analysis

MethodsRegion Proposal Network · Softmax · Convolution · RoIAlign · Mask R-CNN