Multi-Camera Action Dataset for Cross-Camera Action Recognition   Benchmarking

Wenhui Li; Yongkang Wong; An-An Liu; Yang Li; Yu-Ting Su; Mohan; Kankanhalli

arXiv:1607.06408·cs.CV·May 8, 2017

Multi-Camera Action Dataset for Cross-Camera Action Recognition Benchmarking

Wenhui Li, Yongkang Wong, An-An Liu, Yang Li, Yu-Ting Su, Mohan, Kankanhalli

PDF

TL;DR

This paper introduces the Multi-Camera Action Dataset (MCAD) for cross-camera action recognition, highlighting the challenges of view variation and providing a benchmark that reveals significant performance drops in cross-view scenarios.

Contribution

The paper presents a new large-scale dataset and evaluation protocol for cross-camera action recognition in surveillance environments, addressing limitations of existing datasets.

Findings

01

High accuracy (85%) in closed-view scenarios

02

Significant performance drop (from 87.0% to 47.4%) in cross-view scenarios

03

Benchmark results highlight challenges in open view action recognition

Abstract

Action recognition has received increasing attention from the computer vision and machine learning communities in the last decade. To enable the study of this problem, there exist a vast number of action datasets, which are recorded under controlled laboratory settings, real-world surveillance environments, or crawled from the Internet. Apart from the "in-the-wild" datasets, the training and test split of conventional datasets often possess similar environments conditions, which leads to close to perfect performance on constrained datasets. In this paper, we introduce a new dataset, namely Multi-Camera Action Dataset (MCAD), which is designed to evaluate the open view classification problem under the surveillance environment. In total, MCAD contains 14,298 action samples from 18 action categories, which are performed by 20 subjects and independently recorded with 5 cameras. Inspired by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.