KPM-Bench: A Kinematic Parsing Motion Benchmark for Fine-grained Motion-centric Video Understanding
Boda Lin, Yongjie Zhu, Xiaocheng Gong, Wenyu Qin, Meng Wang

TL;DR
This paper introduces KPM-Bench, a new dataset and evaluation framework for fine-grained motion understanding in videos, along with the MoPE algorithm to reduce hallucination in motion descriptions.
Contribution
It presents a novel benchmark dataset with detailed annotations and a new algorithm to improve the accuracy and reliability of motion-centric video captioning models.
Findings
KPM-Bench enables detailed limb-level motion analysis.
MoPE reduces hallucination in motion descriptions.
Enhanced captioning accuracy for complex human motions.
Abstract
Despite recent advancements, video captioning models still face significant limitations in accurately describing fine-grained motion details and suffer from severe hallucination issues. These challenges become particularly prominent when generating captions for motion-centric videos, where precise depiction of intricate movements and limb dynamics is crucial yet often neglected. To alleviate this gap, we introduce an automated annotation pipeline that integrates kinematic-based motion computation with linguistic parsing, enabling detailed decomposition and description of complex human motions. Based on this pipeline, we construct and release the Kinematic Parsing Motion Benchmark (KPM-Bench), a novel open-source dataset designed to facilitate fine-grained motion understanding. KPM-Bench consists of (i) fine-grained video-caption pairs that comprehensively illustrate limb-level dynamics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Action Observation and Synchronization
