Deep Learning for Human Parsing: A Survey
Xiaomei Zhang, Xiangyu Zhu, Ming Tang, Zhen Lei

TL;DR
This survey reviews deep learning methods for human parsing, categorizing approaches into five groups, analyzing their strengths and weaknesses, and discussing datasets, performance, and future research directions in the field.
Contribution
It provides a comprehensive analysis of state-of-the-art deep learning techniques for human parsing, categorizing methods and highlighting their comparative advantages.
Findings
Structured architectures exploit human body hierarchies.
Graph-based networks capture global information effectively.
Context-aware and LSTM methods enhance pixel-level understanding.
Abstract
Human parsing is a key topic in image processing with many applications, such as surveillance analysis, human-robot interaction, person search, and clothing category classification, among many others. Recently, due to the success of deep learning in computer vision, there are a number of works aimed at developing human parsing algorithms using deep learning models. As methods have been proposed, a comprehensive survey of this topic is of great importance. In this survey, we provide an analysis of state-of-the-art human parsing methods, covering a broad spectrum of pioneering works for semantic human parsing. We introduce five insightful categories: (1) structure-driven architectures exploit the relationship of different human parts and the inherent hierarchical structure of a human body, (2) graph-based networks capture the global information to achieve an efficient and complete human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Advanced Neural Network Applications
