Multi-Camera Action Dataset for Cross-Camera Action Recognition Benchmarking
Wenhui Li, Yongkang Wong, An-An Liu, Yang Li, Yu-Ting Su, Mohan, Kankanhalli

TL;DR
This paper introduces the Multi-Camera Action Dataset (MCAD) for cross-camera action recognition, highlighting the challenges of view variation and providing a benchmark that reveals significant performance drops in cross-view scenarios.
Contribution
The paper presents a new large-scale dataset and evaluation protocol for cross-camera action recognition in surveillance environments, addressing limitations of existing datasets.
Findings
High accuracy (85%) in closed-view scenarios
Significant performance drop (from 87.0% to 47.4%) in cross-view scenarios
Benchmark results highlight challenges in open view action recognition
Abstract
Action recognition has received increasing attention from the computer vision and machine learning communities in the last decade. To enable the study of this problem, there exist a vast number of action datasets, which are recorded under controlled laboratory settings, real-world surveillance environments, or crawled from the Internet. Apart from the "in-the-wild" datasets, the training and test split of conventional datasets often possess similar environments conditions, which leads to close to perfect performance on constrained datasets. In this paper, we introduce a new dataset, namely Multi-Camera Action Dataset (MCAD), which is designed to evaluate the open view classification problem under the surveillance environment. In total, MCAD contains 14,298 action samples from 18 action categories, which are performed by 20 subjects and independently recorded with 5 cameras. Inspired by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
