Action Recognition: From Static Datasets to Moving Robots

Fahimeh Rezazadegan; Sareh Shirazi; Ben Upcroft; Michael Milford

arXiv:1701.04925·cs.RO·September 20, 2017

Action Recognition: From Static Datasets to Moving Robots

Fahimeh Rezazadegan, Sareh Shirazi, Ben Upcroft, Michael Milford

PDF

TL;DR

This paper introduces a new action recognition method that effectively detects human actions in videos with camera motion and irrelevant backgrounds, outperforming existing models on multiple datasets and in safety scenarios.

Contribution

The paper presents a novel approach combining generic action region proposals with ConvNet-based shape and motion feature extraction, robust to camera motion and background distractions.

Findings

01

Achieved state-of-the-art performance on benchmark datasets.

02

Outperformed existing methods in datasets emphasizing background and camera motion.

03

Validated effectiveness in abnormal behavior detection for workplace safety.

Abstract

Deep learning models have achieved state-of-the- art performance in recognizing human activities, but often rely on utilizing background cues present in typical computer vision datasets that predominantly have a stationary camera. If these models are to be employed by autonomous robots in real world environments, they must be adapted to perform independently of background cues and camera motion effects. To address these challenges, we propose a new method that firstly generates generic action region proposals with good potential to locate one human action in unconstrained videos regardless of camera motion and then uses action proposals to extract and classify effective shape and motion features by a ConvNet framework. In a range of experiments, we demonstrate that by actively proposing action regions during both training and testing, state-of-the-art or better performance is achieved…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.