Weakly Supervised Action Labeling in Videos Under Ordering Constraints

Piotr Bojanowski; R\'emi Lajugie; Francis Bach; Ivan Laptev; Jean; Ponce; Cordelia Schmid; Josef Sivic

arXiv:1407.1208·cs.CV·July 7, 2014·45 cites

Weakly Supervised Action Labeling in Videos Under Ordering Constraints

Piotr Bojanowski, R\'emi Lajugie, Francis Bach, Ivan Laptev, Jean, Ponce, Cordelia Schmid, Josef Sivic

PDF

Open Access

TL;DR

This paper introduces a weakly supervised method for localizing actions in videos using ordering constraints from text annotations, jointly learning classifiers and temporal labels.

Contribution

It formulates action localization as a weakly supervised temporal assignment problem with ordering constraints, enabling joint learning of classifiers and temporal labels.

Findings

01

Effective localization of actions in videos with minimal supervision

02

Successful application to a large dataset of Hollywood movies

03

Joint learning improves classifier discriminability

Abstract

We are given a set of video clips, each one annotated with an {\em ordered} list of actions, such as "walk" then "sit" then "answer phone" extracted from, for example, the associated text script. We seek to temporally localize the individual actions in each clip as well as to learn a discriminative classifier for each action. We formulate the problem as a weakly supervised temporal assignment with ordering constraints. Each video clip is divided into small time intervals and each time interval of each video clip is assigned one action label, while respecting the order in which the action labels appear in the given annotations. We show that the action label assignment can be determined together with learning a classifier for each action in a discriminative manner. We evaluate the proposed model on a new and challenging dataset of 937 video clips with a total of 787720 frames containing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Analysis and Summarization · Human Motion and Animation