DeepCoder: Semi-parametric Variational Autoencoders for Automatic Facial   Action Coding

Dieu Linh Tran; Robert Walecki; Ognjen Rudovic; Stefanos; Eleftheriadis; Bj{\o}rn Schuller; Maja Pantic

arXiv:1704.02206·cs.CV·August 8, 2017·2 cites

DeepCoder: Semi-parametric Variational Autoencoders for Automatic Facial Action Coding

Dieu Linh Tran, Robert Walecki, Ognjen Rudovic, Stefanos, Eleftheriadis, Bj{\o}rn Schuller, Maja Pantic

PDF

Open Access

TL;DR

DeepCoder introduces a semi-parametric VAE framework combining convolutional and Gaussian Process models to improve facial action unit intensity estimation, outperforming existing methods on benchmark datasets.

Contribution

It presents a novel semi-parametric VAE model that jointly learns hierarchical facial features and ordinal AU intensity classification, integrating parametric and non-parametric approaches.

Findings

01

DeepCoder outperforms state-of-the-art AU intensity estimation methods.

02

The model effectively captures hierarchical facial features.

03

It demonstrates robustness and improved accuracy on benchmark datasets.

Abstract

Human face exhibits an inherent hierarchy in its representations (i.e., holistic facial expressions can be encoded via a set of facial action units (AUs) and their intensity). Variational (deep) auto-encoders (VAE) have shown great results in unsupervised extraction of hierarchical latent representations from large amounts of image data, while being robust to noise and other undesired artifacts. Potentially, this makes VAEs a suitable approach for learning facial features for AU intensity estimation. Yet, most existing VAE-based methods apply classifiers learned separately from the encoded features. By contrast, the non-parametric (probabilistic) approaches, such as Gaussian Processes (GPs), typically outperform their parametric counterparts, but cannot deal easily with large amounts of data. To this end, we propose a novel VAE semi-parametric modeling framework, named DeepCoder, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Emotion and Mood Recognition

MethodsUSD Coin Customer Service Number +1-833-534-1729