FaceBoxes: A CPU Real-time Face Detector with High Accuracy
Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z., Li

TL;DR
FaceBoxes is a novel CPU-efficient face detector that achieves real-time performance with high accuracy by using a lightweight network structure, multi-scale features, and an anchor densification strategy, outperforming previous methods.
Contribution
The paper introduces FaceBoxes, a face detection model combining RDCL and MSCL for speed and accuracy, with a new anchor densification strategy to improve small face detection.
Findings
Runs at 20 FPS on CPU for VGA images
Achieves state-of-the-art accuracy on face detection benchmarks
Speed is invariant to the number of faces in the image
Abstract
Although tremendous strides have been made in face detection, one of the remaining open challenges is to achieve real-time speed on the CPU as well as maintain high performance, since effective models for face detection tend to be computationally prohibitive. To address this challenge, we propose a novel face detector, named FaceBoxes, with superior performance on both speed and accuracy. Specifically, our method has a lightweight yet powerful network structure that consists of the Rapidly Digested Convolutional Layers (RDCL) and the Multiple Scale Convolutional Layers (MSCL). The RDCL is designed to enable FaceBoxes to achieve real-time speed on the CPU. The MSCL aims at enriching the receptive fields and discretizing anchors over different layers to handle faces of various scales. Besides, we propose a new anchor densification strategy to make different types of anchors have the same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Face and Expression Recognition · Biometric Identification and Security
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
