Improving Temporal Action Segmentation via Constraint-Aware Decoding

Yeo Keat Ee; Debaditya Roy; Chen Li; Hao Zhang; Basura Fernando

arXiv:2605.10149·cs.CV·May 12, 2026

Improving Temporal Action Segmentation via Constraint-Aware Decoding

Yeo Keat Ee, Debaditya Roy, Chen Li, Hao Zhang, Basura Fernando

PDF

1 Repo

TL;DR

This paper introduces a lightweight, constraint-based refinement method for temporal action segmentation that improves prediction accuracy by integrating structural priors into a modified Viterbi decoding process.

Contribution

It presents a novel, efficient framework that enhances TAS predictions using statistical structural priors without retraining or increasing model complexity.

Findings

01

Improves both fully and semi-supervised TAS models.

02

Corrects structural prediction errors effectively.

03

Maintains high inference efficiency.

Abstract

Temporal action segmentation (TAS) divides untrimmed videos into labeled action segments. While fully supervised methods have advanced the field, challenges such as action variability, ambiguous boundaries, and high annotation costs remain, especially in new or low-resource domains. Grammar-based approaches improve segmentation with structural priors but rely on complex parsing limiting scalability. In this work, we propose a lightweight, constraint-based refinement framework that enhances TAS predictions by integrating statistical structural priors such as transition confidence, action boundary sets, and per-class duration, that can be directly extracted from annotated data. These constraints are integrated into a modified Viterbi decoding algorithm, allowing inference-time refinement without retraining or added model complexity. Our approach improves both fully and semi-supervised TAS…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LUNAProject22/CAD
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.