EPIR: An Efficient Patch Tokenization, Integration and Representation Framework for Micro-expression Recognition
Junbo Wang, Liangyu Fu, Yuke Li, Yining Zhu, Xuecheng Wu, Kun Hu

TL;DR
EPIR is a novel framework that enhances micro-expression recognition by balancing high accuracy with low computational cost through innovative tokenization and integration techniques.
Contribution
The paper introduces a dual norm shifted tokenization module, a token integration module, and a discriminative token extractor with dynamic token selection, improving Transformer efficiency.
Findings
Achieves 9.6% UF1 improvement on CAS(ME)$^3$ dataset.
Attains 4.58% UAR improvement on SMIC dataset.
Outperforms state-of-the-art methods on four public datasets.
Abstract
Micro-expression recognition can obtain the real emotion of the individual at the current moment. Although deep learning-based methods, especially Transformer-based methods, have achieved impressive results, these methods have high computational complexity due to the large number of tokens in the multi-head self-attention. In addition, the existing micro-expression datasets are small-scale, which makes it difficult for Transformer-based models to learn effective micro-expression representations. Therefore, we propose a novel Efficient Patch tokenization, Integration and Representation framework (EPIR), which can balance high recognition performance and low computational complexity. Specifically, we first propose a dual norm shifted tokenization (DNSPT) module to learn the spatial relationship between neighboring pixels in the face region, which is implemented by a refined spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
