# Detector-in-Detector: Multi-Level Analysis for Human-Parts

**Authors:** Xiaojie Li, Lu Yang, Qing Song, Fuqiang Zhou

arXiv: 1902.07017 · 2019-02-20

## TL;DR

This paper introduces DID-Net, a multi-level detection framework that leverages the correlation between human bodies and parts to improve detection accuracy, trained on a new large dataset called Human-Parts.

## Contribution

We propose a novel Detector-in-Detector network that detects humans and their parts in a coarse-to-fine manner, and introduce a new large dataset for human-part detection.

## Key findings

- Achieves high detection performance on the Human-Parts dataset.
- Effectively detects small hands and faces within human bodies.
- End-to-end training improves overall detection accuracy.

## Abstract

Vision-based person, hand or face detection approaches have achieved incredible success in recent years with the development of deep convolutional neural network (CNN). In this paper, we take the inherent correlation between the body and body parts into account and propose a new framework to boost up the detection performance of the multi-level objects. In particular, we adopt a region-based object detection structure with two carefully designed detectors to separately pay attention to the human body and body parts in a coarse-to-fine manner, which we call Detector-in-Detector network (DID-Net). The first detector is designed to detect human body, hand, and face. The second detector, based on the body detection results of the first detector, mainly focus on the detection of small hand and face inside each body. The framework is trained in an end-to-end way by optimizing a multi-task loss. Due to the lack of human body, face and hand detection dataset, we have collected and labeled a new large dataset named Human-Parts with 14,962 images and 106,879 annotations. Experiments show that our method can achieve excellent performance on Human-Parts.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.07017/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1902.07017/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/1902.07017/full.md

---
Source: https://tomesphere.com/paper/1902.07017