Towards Dense People Detection with Deep Learning and Depth images

David Fuentes-Jimenez; Cristina Losada-Gutierrez; David; Casillas-Perez; Javier Macias-Guarasa; Roberto Martin-Lopez; Daniel; Pizarro; Carlos A.Luna

arXiv:2007.07171·cs.CV·July 15, 2020

Towards Dense People Detection with Deep Learning and Depth images

David Fuentes-Jimenez, Cristina Losada-Gutierrez, David, Casillas-Perez, Javier Macias-Guarasa, Roberto Martin-Lopez, Daniel, Pizarro, Carlos A.Luna

PDF

TL;DR

This paper introduces a deep neural network system that detects multiple people in a scene using depth images, providing real-time performance and robustness to occlusions, with effective training strategies and superior accuracy.

Contribution

The paper presents a novel compact DNN architecture for dense people detection from depth images, combining simulated and real data training for improved generalization.

Findings

01

Outperforms existing state-of-the-art methods

02

Operates in real-time on low-budget GPUs

03

Accurately detects people even with occlusions

Abstract

This paper proposes a DNN-based system that detects multiple people from a single depth image. Our neural network processes a depth image and outputs a likelihood map in image coordinates, where each detection corresponds to a Gaussian-shaped local distribution, centered at the person's head. The likelihood map encodes both the number of detected people and their 2D image positions, and can be used to recover the 3D position of each person using the depth image and the camera calibration parameters. Our architecture is compact, using separated convolutions to increase performance, and runs in real-time with low budget GPUs. We use simulated data for initially training the network, followed by fine tuning with a relatively small amount of real data. We show this strategy to be effective, producing networks that generalize to work with scenes different from those used during training. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.