FLEX: A Largescale Multimodal, Multiview Dataset for Learning Structured Representations for Fitness Action Quality Assessment

Hao Yin; Lijun Gu; Paritosh Parmar; Lin Xu; Tianxiao Guo; Xiujin Liu; Weiwei Fu; Yang Zhang; Tianyou Zheng

arXiv:2506.03198·cs.CV·April 6, 2026

FLEX: A Largescale Multimodal, Multiview Dataset for Learning Structured Representations for Fitness Action Quality Assessment

Hao Yin, Lijun Gu, Paritosh Parmar, Lin Xu, Tianxiao Guo, Xiujin Liu, Weiwei Fu, Yang Zhang, Tianyou Zheng

PDF

1 Repo

TL;DR

FLEX is a comprehensive multimodal dataset for fitness action quality assessment, integrating video, sEMG, and physiological data to improve AI-based fitness evaluation and coaching.

Contribution

It introduces the first large-scale multimodal, multiview fitness dataset with expert annotations and a structured knowledge graph for interpretable assessment.

Findings

01

Multimodal fusion improves AQA accuracy.

02

Multiview data enhances model robustness.

03

Fine-grained annotations benefit detailed feedback.

Abstract

Action Quality Assessment (AQA) -- the task of quantifying how well an action is performed -- has great potential for detecting errors in gym weight training, where accurate feedback is critical to prevent injuries and maximize gains. Existing AQA datasets, however, are limited to single-view competitive sports and RGB video, lacking multimodal signals and professional assessment of fitness actions. We introduce FLEX, the first large-scale, multimodal, multiview dataset for fitness AQA that incorporates surface electromyography (sEMG). FLEX contains over 7,500 multiview recordings of 20 weight-loaded exercises performed by 38 subjects of diverse skill levels, with synchronized RGB video, 3D pose, sEMG, and physiological signals. Expert annotations are organized into a Fitness Knowledge Graph (FKG) linking actions, key steps, error types, and feedback, supporting a compositional scoring…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HaoYin116/FLEX
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.