TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization   Tasks

Humam Alwassel; Silvio Giancola; Bernard Ghanem

arXiv:2011.11479·cs.CV·August 18, 2021

TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks

Humam Alwassel, Silvio Giancola, Bernard Ghanem

PDF

1 Repo

TL;DR

This paper introduces a supervised pretraining method for video clip features that enhances temporal sensitivity, leading to improved performance in various video localization tasks across multiple architectures and datasets.

Contribution

A novel pretraining paradigm that incorporates background and global video information to produce temporally-sensitive features for localization tasks.

Findings

01

Significant performance improvements on three localization tasks.

02

Effective across multiple encoder architectures.

03

Applicable to different pretraining datasets.

Abstract

Due to the large memory footprint of untrimmed videos, current state-of-the-art video localization methods operate atop precomputed video clip features. These features are extracted from video encoders typically trained for trimmed action classification tasks, making such features not necessarily suitable for temporal localization. In this work, we propose a novel supervised pretraining paradigm for clip features that not only trains to classify activities but also considers background clips and global video information to improve temporal sensitivity. Extensive experiments show that using features trained with our novel pretraining strategy significantly improves the performance of recent state-of-the-art methods on three tasks: Temporal Action Localization, Action Proposal Generation, and Dense Video Captioning. We also show that our pretraining approach is effective across three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HumamAlwassel/TSP
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.