GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
Zijian Chen, Wei Sun, Yuan Tian, Jun Jia, Zicheng Zhang, Jiarui Wang,, Ru Huang, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

TL;DR
This paper introduces GAIA, a large-scale dataset for assessing action quality in AI-generated videos, revealing current evaluation methods' limitations and emphasizing the need for improved AQA techniques.
Contribution
The paper constructs GAIA, a comprehensive dataset for action quality assessment in AI-generated videos, and benchmarks existing evaluation methods, highlighting their shortcomings.
Findings
Traditional AQA methods perform poorly on AIGV evaluation.
Current metrics show a significant gap compared to human perception.
GAIA enables better understanding and development of AQA models for AIGVs.
Abstract
Assessing action quality is both imperative and challenging due to its significant impact on the quality of AI-generated videos, further complicated by the inherently ambiguous nature of actions within AI-generated video (AIGV). Current action quality assessment (AQA) algorithms predominantly focus on actions from real specific scenarios and are pre-trained with normative action features, thus rendering them inapplicable in AIGVs. To address these problems, we construct GAIA, a Generic AI-generated Action dataset, by conducting a large-scale subjective evaluation from a novel causal reasoning-based perspective, resulting in 971,244 ratings among 9,180 video-action pairs. Based on GAIA, we evaluate a suite of popular text-to-video (T2V) models on their ability to generate visually rational actions, revealing their pros and cons on different categories of actions. We also extend GAIA as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI) · Human Pose and Action Recognition
MethodsFocus
