Action Quality Assessment via Hierarchical Pose-guided Multi-stage Contrastive Regression
Mengshi Qi, Hao Ye, Jiaxuan Peng, Huadong Ma

TL;DR
This paper introduces a hierarchical pose-guided multi-stage contrastive regression method for action quality assessment, effectively capturing fine-grained features and sub-action segmentation to improve performance on diving and athletic datasets.
Contribution
The paper proposes a novel multi-stage contrastive regression framework with a multi-scale visual-skeleton encoder and sub-action segmentation, advancing AQA accuracy.
Findings
Outperforms existing methods on FineDiving and MTL-AQA datasets.
Effectively captures fine-grained pose differences and sub-action variations.
Demonstrates the benefit of hierarchical pose guidance and contrastive learning.
Abstract
Action Quality Assessment (AQA), which aims at automatic and fair evaluation of athletic performance, has gained increasing attention in recent years. However, athletes are often in rapid movement and the corresponding visual appearance variances are subtle, making it challenging to capture fine-grained pose differences and leading to poor estimation performance. Furthermore, most common AQA tasks, such as diving in sports, are usually divided into multiple sub-actions, each of which contains different durations. However, existing methods focus on segmenting the video into fixed frames, which disrupts the temporal continuity of sub-actions resulting in unavoidable prediction errors. To address these challenges, we propose a novel action quality assessment method through hierarchically pose-guided multi-stage contrastive regression. Firstly, we introduce a multi-scale dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Anomaly Detection Techniques and Applications
MethodsSoftmax · Attention Is All You Need · Contrastive Learning · Focus
