EPIR: An Efficient Patch Tokenization, Integration and Representation Framework for Micro-expression Recognition

Junbo Wang; Liangyu Fu; Yuke Li; Yining Zhu; Xuecheng Wu; Kun Hu

arXiv:2604.08106·cs.CV·April 10, 2026

EPIR: An Efficient Patch Tokenization, Integration and Representation Framework for Micro-expression Recognition

Junbo Wang, Liangyu Fu, Yuke Li, Yining Zhu, Xuecheng Wu, Kun Hu

PDF

TL;DR

EPIR is a novel framework that enhances micro-expression recognition by balancing high accuracy with low computational cost through innovative tokenization and integration techniques.

Contribution

The paper introduces a dual norm shifted tokenization module, a token integration module, and a discriminative token extractor with dynamic token selection, improving Transformer efficiency.

Findings

01

Achieves 9.6% UF1 improvement on CAS(ME)$^3$ dataset.

02

Attains 4.58% UAR improvement on SMIC dataset.

03

Outperforms state-of-the-art methods on four public datasets.

Abstract

Micro-expression recognition can obtain the real emotion of the individual at the current moment. Although deep learning-based methods, especially Transformer-based methods, have achieved impressive results, these methods have high computational complexity due to the large number of tokens in the multi-head self-attention. In addition, the existing micro-expression datasets are small-scale, which makes it difficult for Transformer-based models to learn effective micro-expression representations. Therefore, we propose a novel Efficient Patch tokenization, Integration and Representation framework (EPIR), which can balance high recognition performance and low computational complexity. Specifically, we first propose a dual norm shifted tokenization (DNSPT) module to learn the spatial relationship between neighboring pixels in the face region, which is implemented by a refined spatial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.