Jointly Learning Structured Representations and Stabilized Affinity for Human Motion Segmentation

Xianghan Meng; Zhiyuan Huang; Zhengyu Tong; Chun-Guang Li

arXiv:2605.05753·cs.CV·May 8, 2026

Jointly Learning Structured Representations and Stabilized Affinity for Human Motion Segmentation

Xianghan Meng, Zhiyuan Huang, Zhengyu Tong, Chun-Guang Li

PDF

TL;DR

This paper introduces TDSC, a novel method for human motion segmentation that jointly learns structured representations and stabilized affinity to improve accuracy and robustness, especially when raw features violate traditional assumptions.

Contribution

The proposed TDSC approach innovatively combines self-expressive modeling with temporal constraints and a stabilization mechanism for affinity, advancing HMS performance.

Findings

01

TDSC outperforms existing methods on five benchmark datasets.

02

It effectively handles raw features that violate traditional subspace assumptions.

03

The approach is validated with both conventional and deep features.

Abstract

Human Motion Segmentation (HMS), which aims to partition a video into non-overlapping segments corresponding to different human motions, has recently attracted increasing research attention. Existing HMS approaches are predominantly based on subspace clustering, which are grounded on the assumption that the distribution of high-dimensional temporal features well aligns with a Union-of-Subspaces (UoS). For videos in the real world, however, the raw frame-level features often violate the UoS assumption and yield unsatisfactory segmentation performance. To address this issue, we propose an efficient and effective approach for HMS, named Temporal Deep Self-expressive subspace Clustering (TDSC), which jointly learns temporally consistent structured representations and stabilized affinity for accurate and robust HMS. Specifically, in TDSC, we alternately learn structured representations of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.