Leveraging Action Affinity and Continuity for Semi-supervised Temporal   Action Segmentation

Guodong Ding; Angela Yao

arXiv:2207.08653·cs.CV·July 22, 2022·1 cites

Leveraging Action Affinity and Continuity for Semi-supervised Temporal Action Segmentation

Guodong Ding, Angela Yao

PDF

Open Access

TL;DR

This paper introduces a semi-supervised approach for temporal action segmentation in videos, using novel loss functions and adaptive boundary smoothing to improve performance with limited labeled data.

Contribution

It proposes two new loss functions and an adaptive boundary smoothing method to enhance semi-supervised temporal action segmentation.

Findings

01

Significant improvement with only 5-10% labeled data

02

Achieved comparable results to fully supervised methods with 50% labeled data

03

Enhanced fully-supervised learning performance with ABS

Abstract

We present a semi-supervised learning approach to the temporal action segmentation task. The goal of the task is to temporally detect and segment actions in long, untrimmed procedural videos, where only a small set of videos are densely labelled, and a large collection of videos are unlabelled. To this end, we propose two novel loss functions for the unlabelled data: an action affinity loss and an action continuity loss. The action affinity loss guides the unlabelled samples learning by imposing the action priors induced from the labelled set. Action continuity loss enforces the temporal continuity of actions, which also provides frame-wise classification supervision. In addition, we propose an Adaptive Boundary Smoothing (ABS) approach to build coarser action boundaries for more robust and reliable learning. The proposed loss functions and ABS were evaluated on three benchmarks.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods