FAMNet: Integrating 2D and 3D Features for Micro-expression Recognition via Multi-task Learning and Hierarchical Attention

Liangyu Fu; Xuecheng Wu; Danlei Huang; Xinyi Yin

arXiv:2508.13483·cs.CV·August 20, 2025

FAMNet: Integrating 2D and 3D Features for Micro-expression Recognition via Multi-task Learning and Hierarchical Attention

Liangyu Fu, Xuecheng Wu, Danlei Huang, Xinyi Yin

PDF

TL;DR

FAMNet is a novel multi-task learning framework that combines 2D and 3D CNNs with hierarchical attention to improve micro-expression recognition by effectively capturing fine-grained spatiotemporal features.

Contribution

This paper introduces FAMNet, a new fusion model using multi-task learning and hierarchical attention to enhance micro-expression recognition performance.

Findings

01

Achieves 83.75% UAR on SAMM dataset

02

Outperforms existing methods on CASME II and MMEW datasets

03

Significantly improves recognition on challenging CAS(ME)$^3$ dataset

Abstract

Micro-expressions recognition (MER) has essential application value in many fields, but the short duration and low intensity of micro-expressions (MEs) bring considerable challenges to MER. The current MER methods in deep learning mainly include three data loading methods: static images, dynamic image sequence, and a combination of the two streams. How to effectively extract MEs' fine-grained and spatiotemporal features has been difficult to solve. This paper proposes a new MER method based on multi-task learning and hierarchical attention, which fully extracts MEs' omni-directional features by merging 2D and 3D CNNs. The fusion model consists of a 2D CNN AMNet2D and a 3D CNN AMNet3D, with similar structures consisting of a shared backbone network Resnet18 and attention modules. During training, the model adopts different data loading methods to adapt to two specific networks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.