ME-TST+: Micro-expression Analysis via Temporal State Transition with ROI Relationship Awareness
Zizheng Guo, Bochao Zou, Junbao Zhuo, Huimin Ma

TL;DR
This paper introduces ME-TST+ architectures that leverage temporal state transitions and ROI relationships for more accurate micro-expression analysis, outperforming traditional sliding-window methods.
Contribution
The paper proposes novel state space model-based architectures for joint micro-expression spotting and recognition, incorporating ROI relationship awareness and multi-granularity modeling.
Findings
Achieves state-of-the-art performance on micro-expression datasets.
Effectively models variable-duration micro-expressions with temporal state transitions.
Enhances analysis accuracy through feature and result-level synergy strategies.
Abstract
Micro-expressions (MEs) are regarded as important indicators of an individual's intrinsic emotions, preferences, and tendencies. ME analysis requires spotting of ME intervals within long video sequences and recognition of their corresponding emotional categories. Previous deep learning approaches commonly employ sliding-window classification networks. However, the use of fixed window lengths and hard classification presents notable limitations in practice. Furthermore, these methods typically treat ME spotting and recognition as two separate tasks, overlooking the essential relationship between them. To address these challenges, this paper proposes two state space model-based architectures, namely ME-TST and ME-TST+, which utilize temporal state transition mechanisms to replace conventional window-level classification with video-level regression. This enables a more precise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Time Series Analysis and Forecasting · Human Pose and Action Recognition
