Brute-Force Facial Landmark Analysis With A 140,000-Way Classifier
Mengtian Li, Laszlo Jeni, Deva Ramanan

TL;DR
This paper introduces a novel facial landmark estimation method that formulates the problem as a large-scale classification task with 140,000 classes, enabling rich uncertainty and context modeling, and achieves state-of-the-art results.
Contribution
The work pioneers using an extremely large classification approach for facial landmark analysis, surpassing traditional regression methods and enabling advanced probabilistic reasoning.
Findings
Achieves state-of-the-art facial alignment accuracy in videos.
Demonstrates effective training with 140,000 classes, including many similar fine-grained classes.
Provides a flexible probabilistic framework for landmark uncertainty and context integration.
Abstract
We propose a simple approach to visual alignment, focusing on the illustrative task of facial landmark estimation. While most prior work treats this as a regression problem, we instead formulate it as a discrete -way classification task, where a classifier is trained to return one of discrete alignments. One crucial benefit of a classifier is the ability to report back a (softmax) distribution over putative alignments. We demonstrate that this distribution is a rich representation that can be marginalized (to generate uncertainty estimates over groups of landmarks) and conditioned on (to incorporate top-down context, provided by temporal constraints in a video stream or an interactive human user). Such capabilities are difficult to integrate into classic regression-based approaches. We study performance as a function of the number of classes , including the extreme "exemplar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Face and Expression Recognition · Orthodontics and Dentofacial Orthopedics
