Multi-Order Networks for Action Unit Detection
Gauthier Tallec, Arnaud Dapogny, Kevin Bailly

TL;DR
This paper introduces MONET, a multi-task learning framework that optimizes task order to improve facial Action Unit detection, outperforming existing methods and addressing order-related performance issues.
Contribution
MONET jointly learns task modules with an optimized order using differentiable selection, enhancing multi-task learning for AU detection.
Findings
MONET successfully retrieves optimal task order in toy experiments.
MONET outperforms existing multi-task baselines on attribute detection.
MONET significantly improves state-of-the-art AU detection performance.
Abstract
Action Units (AU) are muscular activations used to describe facial expressions. Therefore accurate AU recognition unlocks unbiaised face representation which can improve face-based affective computing applications. From a learning standpoint AU detection is a multi-task problem with strong inter-task dependencies. To solve such problem, most approaches either rely on weight sharing, or add explicit dependency modelling by decomposing the joint task distribution using Bayes chain rule. If the latter strategy yields comprehensive inter-task relationships modelling, it requires imposing an arbitrary order into an unordered task set. Crucially, this ordering choice has been identified as a source of performance variations. In this paper, we present Multi-Order Network (MONET), a multi-task method with joint task order optimization. MONET uses a differentiable order selection to jointly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Face and Expression Recognition · Advanced Computing and Algorithms
MethodsMixture model network · Dropout
