Fine-grained Apparel Classification and Retrieval without rich annotations
Aniket Bhatnagar, Sanchit Aggarwal

TL;DR
This paper introduces a robust apparel classification and retrieval framework that operates without requiring detailed annotations like bounding boxes or landmarks, using compact bilinear CNNs and triplet loss for improved accuracy across domains.
Contribution
The proposed method eliminates the need for rich annotations and bounding box detectors, enabling direct training on category labels and triplets for apparel classification and retrieval.
Findings
Achieves competitive accuracy on DeepFashion datasets.
Effective in cross-domain retrieval with varied image conditions.
Reduces annotation dependency compared to previous methods.
Abstract
The ability to correctly classify and retrieve apparel images has a variety of applications important to e-commerce, online advertising and internet search. In this work, we propose a robust framework for fine-grained apparel classification, in-shop and cross-domain retrieval which eliminates the requirement of rich annotations like bounding boxes and human-joints or clothing landmarks, and training of bounding box/ key-landmark detector for the same. Factors such as subtle appearance differences, variations in human poses, different shooting angles, apparel deformations, and self-occlusion add to the challenges in classification and retrieval of apparel items. Cross-domain retrieval is even harder due to the presence of large variation between online shopping images, usually taken in ideal lighting, pose, positive angle and clean background as compared with street photos captured by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Face recognition and analysis · Advanced Neural Network Applications
MethodsTriplet Loss
