FAMNet: Integrating 2D and 3D Features for Micro-expression Recognition via Multi-task Learning and Hierarchical Attention
Liangyu Fu, Xuecheng Wu, Danlei Huang, Xinyi Yin

TL;DR
FAMNet is a novel multi-task learning framework that combines 2D and 3D CNNs with hierarchical attention to improve micro-expression recognition by effectively capturing fine-grained spatiotemporal features.
Contribution
This paper introduces FAMNet, a new fusion model using multi-task learning and hierarchical attention to enhance micro-expression recognition performance.
Findings
Achieves 83.75% UAR on SAMM dataset
Outperforms existing methods on CASME II and MMEW datasets
Significantly improves recognition on challenging CAS(ME)$^3$ dataset
Abstract
Micro-expressions recognition (MER) has essential application value in many fields, but the short duration and low intensity of micro-expressions (MEs) bring considerable challenges to MER. The current MER methods in deep learning mainly include three data loading methods: static images, dynamic image sequence, and a combination of the two streams. How to effectively extract MEs' fine-grained and spatiotemporal features has been difficult to solve. This paper proposes a new MER method based on multi-task learning and hierarchical attention, which fully extracts MEs' omni-directional features by merging 2D and 3D CNNs. The fusion model consists of a 2D CNN AMNet2D and a 3D CNN AMNet3D, with similar structures consisting of a shared backbone network Resnet18 and attention modules. During training, the model adopts different data loading methods to adapt to two specific networks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
