Modeling Multi-Label Action Dependencies for Temporal Action   Localization

Praveen Tirupattur; Kevin Duarte; Yogesh Rawat; Mubarak Shah

arXiv:2103.03027·cs.CV·June 1, 2021

Modeling Multi-Label Action Dependencies for Temporal Action Localization

Praveen Tirupattur, Kevin Duarte, Yogesh Rawat, Mubarak Shah

PDF

1 Repo

TL;DR

This paper introduces an attention-based model that captures both co-occurrence and temporal dependencies between actions to improve the accuracy of localizing multiple actions in untrimmed videos.

Contribution

It proposes a novel Multi-Label Action Dependency (MLAD) layer with two branches to explicitly model action dependencies at different time scales.

Findings

01

Improved f-mAP on MultiTHUMOS and Charades datasets.

02

New metrics effectively measure action dependency modeling.

03

State-of-the-art performance achieved.

Abstract

Real-world videos contain many complex actions with inherent relationships between action classes. In this work, we propose an attention-based architecture that models these action relationships for the task of temporal action localization in untrimmed videos. As opposed to previous works that leverage video-level co-occurrence of actions, we distinguish the relationships between actions that occur at the same time-step and actions that occur at different time-steps (i.e. those which precede or follow each other). We define these distinct relationships as action dependencies. We propose to improve action localization performance by modeling these action dependencies in a novel attention-based Multi-Label Action Dependency (MLAD)layer. The MLAD layer consists of two branches: a Co-occurrence Dependency Branch and a Temporal Dependency Branch to model co-occurrence action dependencies and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ptirupat/MLAD
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.