Pose-Aware Multi-Level Motion Parsing for Action Quality Assessment
Shuaikang Zhu, Yang Yang, Chen Sun

TL;DR
This paper introduces a multi-level motion parsing framework leveraging enhanced pose features for precise action segmentation and scoring in action quality assessment, demonstrating state-of-the-art results on diving datasets.
Contribution
The paper presents a novel multi-level motion parsing framework with an Action-Unit Parser, Motion Parser, Condition Parser, and Weight-Adjust Scoring Module for improved AQA performance.
Findings
Achieves state-of-the-art action segmentation accuracy.
Outperforms existing methods in action scoring tasks.
Effectively incorporates special conditions like water splash.
Abstract
Human pose serves as a cornerstone of action quality assessment (AQA), where subtle spatial-temporal variations in pose often distinguish excellence from mediocrity. In high-level competitions, these nuanced differences become decisive factors in scoring. In this paper, we propose a novel multi-level motion parsing framework for AQA based on enhanced spatial-temporal pose features. On the first level, the Action-Unit Parser is designed with the help of pose extraction to achieve precise action segmentation and comprehensive local-global pose representations. On the second level, Motion Parser is used by spatial-temporal feature learning to capture pose changes and appearance details for each action-unit. Meanwhile, some special conditions other than body-related will impact action scoring, like water splash in diving. In this work, we design an additional Condition Parser to offer users…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning · Human Motion and Animation
