FaceBoxes: A CPU Real-time Face Detector with High Accuracy

Shifeng Zhang; Xiangyu Zhu; Zhen Lei; Hailin Shi; Xiaobo Wang; Stan Z.; Li

arXiv:1708.05234·cs.CV·December 27, 2018·40 cites

FaceBoxes: A CPU Real-time Face Detector with High Accuracy

Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z., Li

PDF

Open Access 5 Repos

TL;DR

FaceBoxes is a novel CPU-efficient face detector that achieves real-time performance with high accuracy by using a lightweight network structure, multi-scale features, and an anchor densification strategy, outperforming previous methods.

Contribution

The paper introduces FaceBoxes, a face detection model combining RDCL and MSCL for speed and accuracy, with a new anchor densification strategy to improve small face detection.

Findings

01

Runs at 20 FPS on CPU for VGA images

02

Achieves state-of-the-art accuracy on face detection benchmarks

03

Speed is invariant to the number of faces in the image

Abstract

Although tremendous strides have been made in face detection, one of the remaining open challenges is to achieve real-time speed on the CPU as well as maintain high performance, since effective models for face detection tend to be computationally prohibitive. To address this challenge, we propose a novel face detector, named FaceBoxes, with superior performance on both speed and accuracy. Specifically, our method has a lightweight yet powerful network structure that consists of the Rapidly Digested Convolutional Layers (RDCL) and the Multiple Scale Convolutional Layers (MSCL). The RDCL is designed to enable FaceBoxes to achieve real-time speed on the CPU. The MSCL aims at enriching the receptive fields and discretizing anchors over different layers to handle faces of various scales. Besides, we propose a new anchor densification strategy to make different types of anchors have the same…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Face and Expression Recognition · Biometric Identification and Security

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings