Leveraging Action Affinity and Continuity for Semi-supervised Temporal Action Segmentation
Guodong Ding, Angela Yao

TL;DR
This paper introduces a semi-supervised approach for temporal action segmentation in videos, using novel loss functions and adaptive boundary smoothing to improve performance with limited labeled data.
Contribution
It proposes two new loss functions and an adaptive boundary smoothing method to enhance semi-supervised temporal action segmentation.
Findings
Significant improvement with only 5-10% labeled data
Achieved comparable results to fully supervised methods with 50% labeled data
Enhanced fully-supervised learning performance with ABS
Abstract
We present a semi-supervised learning approach to the temporal action segmentation task. The goal of the task is to temporally detect and segment actions in long, untrimmed procedural videos, where only a small set of videos are densely labelled, and a large collection of videos are unlabelled. To this end, we propose two novel loss functions for the unlabelled data: an action affinity loss and an action continuity loss. The action affinity loss guides the unlabelled samples learning by imposing the action priors induced from the labelled set. Action continuity loss enforces the temporal continuity of actions, which also provides frame-wise classification supervision. In addition, we propose an Adaptive Boundary Smoothing (ABS) approach to build coarser action boundaries for more robust and reliable learning. The proposed loss functions and ABS were evaluated on three benchmarks.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods
