Choose Settings Carefully: Comparing Action Unit detection at Different Settings Using a Large-Scale Dataset
Mina Bishay, Ahmed Ghoneim, Mohamed Ashraf, Mohammad Mavadati

TL;DR
This study systematically examines how different preprocessing and training settings affect Action Unit detection performance using a large-scale, in-the-wild dataset, highlighting the importance of setting choices.
Contribution
It is the first to analyze the impact of various preprocessing and training configurations on AU detection accuracy and complexity.
Findings
Preprocessing choices significantly influence AU detection performance.
Classifier type and training data size affect model accuracy.
Optimal settings vary depending on the specific AU detection task.
Abstract
In this paper, we investigate the impact of some of the commonly used settings for (a) preprocessing face images, and (b) classification and training, on Action Unit (AU) detection performance and complexity. We use in our investigation a large-scale dataset, consisting of ~55K videos collected in the wild for participants watching commercial ads. The preprocessing settings include scaling the face to a fixed resolution, changing the color information (RGB to gray-scale), aligning the face, and cropping AU regions, while the classification and training settings include the kind of classifier (multi-label vs. binary) and the amount of data used for training models. To the best of our knowledge, no work had investigated the effect of those settings on AU detection. In our analysis we use CNNs as our baseline classification model.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
