Video Action Detection: Analysing Limitations and Challenges
Rajat Modi, Aayush Jung Rana, Akash Kumar, Praveen Tirupattur, Shruti, Vyas, Yogesh Singh Rawat, Mubarak Shah

TL;DR
This paper critically analyzes existing video action detection datasets, introduces a new dataset called MAMA, and investigates the importance of temporal information and biases in current methods.
Contribution
It proposes the MAMA dataset to address limitations in existing datasets and explores the role of motion and temporal ordering in action detection.
Findings
Existing datasets have biases and limitations.
Motion information may not always be necessary for action recognition.
Temporal ordering can be less critical than assumed.
Abstract
Beyond possessing large enough size to feed data hungry machines (eg, transformers), what attributes measure the quality of a dataset? Assuming that the definitions of such attributes do exist, how do we quantify among their relative existences? Our work attempts to explore these questions for video action detection. The task aims to spatio-temporally localize an actor and assign a relevant action class. We first analyze the existing datasets on video action detection and discuss their limitations. Next, we propose a new dataset, Multi Actor Multi Action (MAMA) which overcomes these limitations and is more suitable for real world applications. In addition, we perform a biasness study which analyzes a key property differentiating videos from static images: the temporal aspect. This reveals if the actions in these datasets really need the motion information of an actor, or whether they…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Multimodal Machine Learning Applications
