Leveraging Self-Supervised Training for Unintentional Action Recognition

Enea Duka; Anna Kukleva; Bernt Schiele

arXiv:2209.11870·cs.CV·September 27, 2022

Leveraging Self-Supervised Training for Unintentional Action Recognition

Enea Duka, Anna Kukleva, Bernt Schiele

PDF

Open Access

TL;DR

This paper introduces a multi-stage, self-supervised framework for recognizing unintentional actions in videos by exploiting inherent biases and temporal transformations, significantly advancing the state-of-the-art.

Contribution

It proposes a novel multi-stage approach combined with self-supervised temporal transformations (T2IBUA) to improve unintentional action recognition in videos.

Findings

01

Strong performance improvements over existing methods

02

Effective modeling of temporal information at multiple levels

03

Extensive ablation studies validate the framework's effectiveness

Abstract

Unintentional actions are rare occurrences that are difficult to define precisely and that are highly dependent on the temporal context of the action. In this work, we explore such actions and seek to identify the points in videos where the actions transition from intentional to unintentional. We propose a multi-stage framework that exploits inherent biases such as motion speed, motion direction, and order to recognize unintentional actions. To enhance representations via self-supervised training for the task of unintentional action recognition we propose temporal transformations, called Temporal Transformations of Inherent Biases of Unintentional Actions (T2IBUA). The multi-stage approach models the temporal information on both the level of individual frames and full clips. These enhanced representations show strong performance for unintentional action recognition tasks. We provide an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Multimodal Machine Learning Applications