Enabling Weakly-Supervised Temporal Action Localization from On-Device Learning of the Video Stream
Yue Tang, Yawen Wu, Peipei Zhou, and Jingtong Hu

TL;DR
This paper introduces an on-device weakly-supervised temporal action localization method that learns directly from streaming videos by segmenting and sampling efficiently, reducing annotation effort and adapting to new environments.
Contribution
It proposes a novel self-adaptive video dividing and contrast score-based segment merging approach for on-device streaming video learning in TAL tasks.
Findings
First to learn directly from on-device long video streams
Effective segmentation and sampling strategies reduce labeling effort
Improved adaptability to new environments in TAL models
Abstract
Detecting actions in videos have been widely applied in on-device applications. Practical on-device videos are always untrimmed with both action and background. It is desirable for a model to both recognize the class of action and localize the temporal position where the action happens. Such a task is called temporal action location (TAL), which is always trained on the cloud where multiple untrimmed videos are collected and labeled. It is desirable for a TAL model to continuously and locally learn from new data, which can directly improve the action detection precision while protecting customers' privacy. However, it is non-trivial to train a TAL model, since tremendous video samples with temporal annotations are required. However, annotating videos frame by frame is exorbitantly time-consuming and expensive. Although weakly-supervised TAL (W-TAL) has been proposed to learn from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Human Pose and Action Recognition · Video Surveillance and Tracking Methods
