Weak Supervision with Arbitrary Single Frame for Micro- and   Macro-expression Spotting

Wang-Wang Yu; Xian-Shi Zhang; Fu-Ya Luo; Yijun Cao; Kai-Fu Yang,; Hong-Mei Yan; and Yong-Jie Li

arXiv:2403.14240·cs.CV·March 22, 2024·1 cites

Weak Supervision with Arbitrary Single Frame for Micro- and Macro-expression Spotting

Wang-Wang Yu, Xian-Shi Zhang, Fu-Ya Luo, Yijun Cao, Kai-Fu Yang,, Hong-Mei Yan, and Yong-Jie Li

PDF

Open Access

TL;DR

This paper introduces a weakly-supervised framework for micro- and macro-expression spotting that requires only a single annotated frame per expression, significantly reducing annotation effort while maintaining high accuracy.

Contribution

The proposed PWES framework employs novel pseudo-labeling and contrastive learning strategies to improve localization accuracy with minimal supervision.

Findings

01

Achieves performance comparable to fully-supervised methods

02

Effective pseudo-label generation improves localization accuracy

03

Global feature representation enhances model robustness

Abstract

Frame-level micro- and macro-expression spotting methods require time-consuming frame-by-frame observation during annotation. Meanwhile, video-level spotting lacks sufficient information about the location and number of expressions during training, resulting in significantly inferior performance compared with fully-supervised spotting. To bridge this gap, we propose a point-level weakly-supervised expression spotting (PWES) framework, where each expression requires to be annotated with only one random frame (i.e., a point). To mitigate the issue of sparse label distribution, the prevailing solution is pseudo-label mining, which, however, introduces new problems: localizing contextual background snippets results in inaccurate boundaries and discarding foreground snippets leads to fragmentary predictions. Therefore, we design the strategies of multi-refined pseudo label generation (MPLG)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Advanced Vision and Imaging

MethodsContrastive Learning