# Multi-View Dynamic Facial Action Unit Detection

**Authors:** Andres Romero, Juan Leon, Pablo Arbelaez

arXiv: 1704.07863 · 2018-08-21

## TL;DR

This paper introduces a multi-view CNN-based method for dynamic facial action unit detection, achieving significant improvements over baseline and state-of-the-art methods in the FERA 2017 Challenge.

## Contribution

It presents a novel multi-view extension of CNNs for facial action unit detection, incorporating viewpoint prediction and ensemble classifiers for improved accuracy.

## Key findings

- Outperforms FERA 2017 Challenge baseline by 14% F1-metric
- Effectively incorporates multi-view information for facial analysis
- Provides a modular system easily extendable with new action units

## Abstract

We propose a novel convolutional neural network approach to address the fine-grained recognition problem of multi-view dynamic facial action unit detection. We leverage recent gains in large-scale object recognition by formulating the task of predicting the presence or absence of a specific action unit in a still image of a human face as holistic classification. We then explore the design space of our approach by considering both shared and independent representations for separate action units, and also different CNN architectures for combining color and motion information. We then move to the novel setup of the FERA 2017 Challenge, in which we propose a multi-view extension of our approach that operates by first predicting the viewpoint from which the video was taken, and then evaluating an ensemble of action unit detectors that were trained for that specific viewpoint. Our approach is holistic, efficient, and modular, since new action units can be easily included in the overall system. Our approach significantly outperforms the baseline of the FERA 2017 Challenge, with an absolute improvement of 14% on the F1-metric. Additionally, it compares favorably against the winner of the FERA 2017 challenge. Code source is available at https://github.com/BCV-Uniandes/AUNets.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1704.07863/full.md

## Figures

36 figures with captions in the complete paper: https://tomesphere.com/paper/1704.07863/full.md

## References

57 references — full list in the complete paper: https://tomesphere.com/paper/1704.07863/full.md

---
Source: https://tomesphere.com/paper/1704.07863