W-TALC: Weakly-supervised Temporal Activity Localization and Classification
Sujoy Paul, Sourya Roy, Amit K Roy-Chowdhury

TL;DR
W-TALC is a weakly-supervised framework that localizes and classifies activities in videos using only video-level labels, significantly reducing annotation effort while outperforming existing methods on challenging datasets.
Contribution
The paper introduces W-TALC, a novel weakly-supervised approach combining a Two-Stream feature extractor and a specialized module, enabling effective activity localization with minimal supervision.
Findings
Outperforms state-of-the-art methods on Thumos14 and ActivityNet1.2 datasets.
Effectively localizes activities at fine granularity.
Reduces manual annotation effort in activity recognition.
Abstract
Most activity localization methods in the literature suffer from the burden of frame-wise annotation requirement. Learning from weak labels may be a potential solution towards reducing such manual labeling effort. Recent years have witnessed a substantial influx of tagged videos on the Internet, which can serve as a rich source of weakly-supervised training data. Specifically, the correlations between videos with similar tags can be utilized to temporally localize the activities. Towards this goal, we present W-TALC, a Weakly-supervised Temporal Activity Localization and Classification framework using only video-level labels. The proposed network can be divided into two sub-networks, namely the Two-Stream based feature extractor network and a weakly-supervised module, which we learn by optimizing two complimentary loss functions. Qualitative and quantitative results on two challenging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Context-Aware Activity Recognition Systems · Anomaly Detection Techniques and Applications
