Human Activity Recognition in RGB-D Videos by Dynamic Images
Snehasis Mukherjee, Leburu Anvitha, T. Mohana Lahari

TL;DR
This paper introduces a novel approach for recognizing human activities in challenging RGB-D videos with multiple performers by using dynamic images and deep neural networks, achieving superior results on new and existing datasets.
Contribution
The paper presents a new method that captures motion information through dynamic images from RGB-D videos with multiple performers, a task not previously addressed.
Findings
Achieved comparable performance on MSR Action 3D dataset.
Outperformed state-of-the-art on a new multi-performer dataset.
Effectively extracted motion features reducing complexity.
Abstract
Human Activity Recognition in RGB-D videos has been an active research topic during the last decade. However, no efforts have been found in the literature, for recognizing human activity in RGB-D videos where several performers are performing simultaneously. In this paper we introduce such a challenging dataset with several performers performing the activities. We present a novel method for recognizing human activities in such videos. The proposed method aims in capturing the motion information of the whole video by producing a dynamic image corresponding to the input video. We use two parallel ResNext-101 to produce the dynamic images for the RGB video and depth video separately. The dynamic images contain only the motion information and hence, the unnecessary background information are eliminated. We send the two dynamic images extracted from the RGB and Depth videos respectively,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
