Weak Supervision with Arbitrary Single Frame for Micro- and Macro-expression Spotting
Wang-Wang Yu, Xian-Shi Zhang, Fu-Ya Luo, Yijun Cao, Kai-Fu Yang,, Hong-Mei Yan, and Yong-Jie Li

TL;DR
This paper introduces a weakly-supervised framework for micro- and macro-expression spotting that requires only a single annotated frame per expression, significantly reducing annotation effort while maintaining high accuracy.
Contribution
The proposed PWES framework employs novel pseudo-labeling and contrastive learning strategies to improve localization accuracy with minimal supervision.
Findings
Achieves performance comparable to fully-supervised methods
Effective pseudo-label generation improves localization accuracy
Global feature representation enhances model robustness
Abstract
Frame-level micro- and macro-expression spotting methods require time-consuming frame-by-frame observation during annotation. Meanwhile, video-level spotting lacks sufficient information about the location and number of expressions during training, resulting in significantly inferior performance compared with fully-supervised spotting. To bridge this gap, we propose a point-level weakly-supervised expression spotting (PWES) framework, where each expression requires to be annotated with only one random frame (i.e., a point). To mitigate the issue of sparse label distribution, the prevailing solution is pseudo-label mining, which, however, introduces new problems: localizing contextual background snippets results in inaccurate boundaries and discarding foreground snippets leads to fragmentary predictions. Therefore, we design the strategies of multi-refined pseudo label generation (MPLG)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Advanced Vision and Imaging
MethodsContrastive Learning
