Temporal Action Segmentation from Timestamp Supervision
Zhe Li, Yazan Abu Farha, Juergen Gall

TL;DR
This paper introduces timestamp supervision for temporal action segmentation, offering a cost-effective alternative to full annotations and achieving performance comparable to fully supervised methods.
Contribution
It proposes a novel training approach using only timestamps, including a confidence loss to improve learning of all action frames.
Findings
Models trained with timestamps perform similarly to fully supervised models.
Timestamp supervision reduces annotation effort compared to full frame-wise labels.
The approach effectively detects action changes using only sparse timestamp annotations.
Abstract
Temporal action segmentation approaches have been very successful recently. However, annotating videos with frame-wise labels to train such models is very expensive and time consuming. While weakly supervised methods trained using only ordered action lists require less annotation effort, the performance is still worse than fully supervised approaches. In this paper, we propose to use timestamp supervision for the temporal action segmentation task. Timestamps require a comparable annotation effort to weakly supervised approaches, and yet provide a more supervisory signal. To demonstrate the effectiveness of timestamp supervision, we propose an approach to train a segmentation model using only timestamps annotations. Our approach uses the model output and the annotated timestamps to generate frame-wise labels by detecting the action changes. We further introduce a confidence loss that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Anomaly Detection Techniques and Applications
