Visual Self-paced Iterative Learning for Unsupervised Temporal Action   Localization

Yupeng Hu; Han Jiang; Hao Liu; Kun Wang; Haoyu Tang; Liqiang Nie

arXiv:2312.07384·cs.CV·April 1, 2025·1 cites

Visual Self-paced Iterative Learning for Unsupervised Temporal Action Localization

Yupeng Hu, Han Jiang, Hao Liu, Kun Wang, Haoyu Tang, Liqiang Nie

PDF

Open Access 1 Repo

TL;DR

This paper introduces a self-paced iterative learning approach for unsupervised temporal action localization that improves clustering confidence and pseudolabel reliability, achieving superior results on public datasets.

Contribution

It proposes a novel self-paced iterative learning framework that enhances clustering and pseudolabel accuracy for unsupervised TAL, addressing key limitations of prior methods.

Findings

01

Outperforms state-of-the-art unsupervised TAL methods on public datasets.

02

Improves clustering confidence through contextual feature exploration.

03

Ensures reliable pseudolabels via incremental instance learning strategies.

Abstract

Recently, temporal action localization (TAL) has garnered significant interest in information retrieval community. However, existing supervised/weakly supervised methods are heavily dependent on extensive labeled temporal boundaries and action categories, which is labor-intensive and time-consuming. Although some unsupervised methods have utilized the ``iteratively clustering and localization'' paradigm for TAL, they still suffer from two pivotal impediments: 1) unsatisfactory video clustering confidence, and 2) unreliable video pseudolabels for model training. To address these limitations, we present a novel self-paced iterative learning model to enhance clustering and localization training simultaneously, thereby facilitating more effective unsupervised TAL. Concretely, we improve the clustering confidence through exploring the contextual feature-robust visual information. Thereafter,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tanghaoyu258/feel
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning