Towards Open-World Human Action Segmentation Using Graph Convolutional Networks
Hao Xing, Kai Zhe Boey, Gordon Cheng

TL;DR
This paper introduces a novel framework for open-world human action segmentation using graph convolutional networks, capable of detecting and segmenting unseen actions without manual annotations, advancing activity understanding in dynamic environments.
Contribution
The paper proposes an innovative structured framework with a new GCN decoder, Mixup training for out-of-distribution data synthesis, and a Temporal Clustering loss for improved open-world action segmentation.
Findings
Significant improvements in open-set segmentation metrics (F1@50)
Enhanced out-of-distribution detection performance (AUROC)
Effective ablation of proposed components
Abstract
Human-object interaction segmentation is a fundamental task of daily activity understanding, which plays a crucial role in applications such as assistive robotics, healthcare, and autonomous systems. Most existing learning-based methods excel in closed-world action segmentation, they struggle to generalize to open-world scenarios where novel actions emerge. Collecting exhaustive action categories for training is impractical due to the dynamic diversity of human activities, necessitating models that detect and segment out-of-distribution actions without manual annotation. To address this issue, we formally define the open-world action segmentation problem and propose a structured framework for detecting and segmenting unseen actions. Our framework introduces three key innovations: 1) an Enhanced Pyramid Graph Convolutional Network (EPGCN) with a novel decoder module for robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Advanced Neural Network Applications
