Knowing When to Quit: Selective Cascaded Regression with Patch Attention   for Real-Time Face Alignment

Gil Shapira; Noga Levy; Ishay Goldin; Roy J. Jevnisek

arXiv:2108.00377·cs.CV·August 4, 2021

Knowing When to Quit: Selective Cascaded Regression with Patch Attention for Real-Time Face Alignment

Gil Shapira, Noga Levy, Ishay Goldin, Roy J. Jevnisek

PDF

1 Repo

TL;DR

This paper introduces a real-time face alignment method that adaptively stops iterations based on predicted error, uses patch attention for efficiency, and achieves high accuracy on challenging datasets with low computational cost.

Contribution

It proposes a selective cascaded regression framework with patch attention, enabling faster and more accurate face alignment by early stopping and patch-based inference.

Findings

01

Achieves real-time performance on mobile devices.

02

Outperforms state-of-the-art methods with under 1000 MMA operations.

03

Maintains high accuracy with a normalized mean error of 8.16 on 300W.

Abstract

Facial landmarks (FLM) estimation is a critical component in many face-related applications. In this work, we aim to optimize for both accuracy and speed and explore the trade-off between them. Our key observation is that not all faces are created equal. Frontal faces with neutral expressions converge faster than faces with extreme poses or expressions. To differentiate among samples, we train our model to predict the regression error after each iteration. If the current iteration is accurate enough, we stop iterating, saving redundant iterations while keeping the accuracy in check. We also observe that as neighboring patches overlap, we can infer all facial landmarks (FLMs) with only a small number of patches without a major accuracy sacrifice. Architecturally, we offer a multi-scale, patch-based, lightweight feature extractor with a fine-grained local patch attention module, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ligaripash/KWtQ-face-alignment
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.