End-to-end One-shot Human Parsing
Haoyu He, Bohan Zhuang, Jing Zhang, Jianfei Cai, Dacheng Tao

TL;DR
This paper introduces a novel one-shot human parsing task that enables flexible parsing of humans into new classes with limited training data, using an end-to-end network that adapts quickly to novel classes and mitigates bias.
Contribution
The paper proposes the EOP-Net, an end-to-end framework for one-shot human parsing that handles small classes, reduces testing bias, and discriminates similar parts, advancing the field of flexible human parsing.
Findings
EOP-Net outperforms existing one-shot segmentation models significantly.
The method effectively adapts to novel classes with limited training data.
The approach reduces testing bias and improves discrimination of similar parts.
Abstract
Previous human parsing methods are limited to parsing humans into pre-defined classes, which is inflexible for practical fashion applications that often have new fashion item classes. In this paper, we define a novel one-shot human parsing (OSHP) task that requires parsing humans into an open set of classes defined by any test example. During training, only base classes are exposed, which only overlap with part of the test-time classes. To address three main challenges in OSHP, i.e., small sizes, testing bias, and similar parts, we devise an End-to-end One-shot human Parsing Network (EOP-Net). Firstly, an end-to-end human parsing framework is proposed to parse the query image into both coarse-grained and fine-grained human classes, which embeds rich semantic information that is shared across different granularities to identify the small-sized human classes. Then, we gradually smooth the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
