Enabling Weakly-Supervised Temporal Action Localization from On-Device   Learning of the Video Stream

Yue Tang; Yawen Wu; Peipei Zhou; and Jingtong Hu

arXiv:2208.12673·cs.CV·August 29, 2022

Enabling Weakly-Supervised Temporal Action Localization from On-Device Learning of the Video Stream

Yue Tang, Yawen Wu, Peipei Zhou, and Jingtong Hu

PDF

Open Access

TL;DR

This paper introduces an on-device weakly-supervised temporal action localization method that learns directly from streaming videos by segmenting and sampling efficiently, reducing annotation effort and adapting to new environments.

Contribution

It proposes a novel self-adaptive video dividing and contrast score-based segment merging approach for on-device streaming video learning in TAL tasks.

Findings

01

First to learn directly from on-device long video streams

02

Effective segmentation and sampling strategies reduce labeling effort

03

Improved adaptability to new environments in TAL models

Abstract

Detecting actions in videos have been widely applied in on-device applications. Practical on-device videos are always untrimmed with both action and background. It is desirable for a model to both recognize the class of action and localize the temporal position where the action happens. Such a task is called temporal action location (TAL), which is always trained on the cloud where multiple untrimmed videos are collected and labeled. It is desirable for a TAL model to continuously and locally learn from new data, which can directly improve the action detection precision while protecting customers' privacy. However, it is non-trivial to train a TAL model, since tremendous video samples with temporal annotations are required. However, annotating videos frame by frame is exorbitantly time-consuming and expensive. Although weakly-supervised TAL (W-TAL) has been proposed to learn from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Human Pose and Action Recognition · Video Surveillance and Tracking Methods