Learning to Align Sequential Actions in the Wild

Weizhe Liu; Bugra Tekin; Huseyin Coskun; Vibhav Vineet; Pascal Fua,; Marc Pollefeys

arXiv:2111.09301·cs.CV·November 18, 2021·1 cites

Learning to Align Sequential Actions in the Wild

Weizhe Liu, Bugra Tekin, Huseyin Coskun, Vibhav Vineet, Pascal Fua,, Marc Pollefeys

PDF

Open Access

TL;DR

This paper introduces a novel method for aligning sequential actions in videos that accounts for temporal variations, background frames, and non-monotonic sequences, outperforming existing self-supervised approaches.

Contribution

The proposed approach enforces temporal priors on the optimal transport matrix, enabling alignment of diverse and non-monotonic action sequences in the wild.

Findings

01

Outperforms state-of-the-art on four benchmarks

02

Handles background frames effectively

03

Accounts for non-monotonic action sequences

Abstract

State-of-the-art methods for self-supervised sequential action alignment rely on deep networks that find correspondences across videos in time. They either learn frame-to-frame mapping across sequences, which does not leverage temporal information, or assume monotonic alignment between each video pair, which ignores variations in the order of actions. As such, these methods are not able to deal with common real-world scenarios that involve background frames or videos that contain non-monotonic sequence of actions. In this paper, we propose an approach to align sequential actions in the wild that involve diverse temporal variations. To this end, we propose an approach to enforce temporal priors on the optimal transport matrix, which leverages temporal consistency, while allowing for variations in the order of actions. Our model accounts for both monotonic and non-monotonic sequences…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Gait Recognition and Analysis