Towards Robust Human Activity Recognition from RGB Video Stream with   Limited Labeled Data

Krishanu Sarker; Mohamed Masoud; Saeid Belkasim; Shihao Ji

arXiv:1812.06544·cs.CV·December 18, 2018·1 cites

Towards Robust Human Activity Recognition from RGB Video Stream with Limited Labeled Data

Krishanu Sarker, Mohamed Masoud, Saeid Belkasim, Shihao Ji

PDF

Open Access

TL;DR

This paper introduces a novel RGB-only human activity recognition framework using skeleton data and deep BLSTM, overcoming limited labeled data challenges and outperforming RGB-D based methods.

Contribution

The paper presents a new RGB-only activity recognition approach with innovative training techniques that surpass state-of-the-art RGB-D methods.

Findings

01

RGB-only approach outperforms RGB-D methods

02

Effective training with limited labeled data

03

Proposed techniques improve model robustness

Abstract

Human activity recognition based on video streams has received numerous attentions in recent years. Due to lack of depth information, RGB video based activity recognition performs poorly compared to RGB-D video based solutions. On the other hand, acquiring depth information, inertia etc. is costly and requires special equipment, whereas RGB video streams are available in ordinary cameras. Hence, our goal is to investigate whether similar or even higher accuracy can be achieved with RGB-only modality. In this regard, we propose a novel framework that couples skeleton data extracted from RGB video and deep Bidirectional Long Short Term Memory (BLSTM) model for activity recognition. A big challenge of training such a deep network is the limited training data, and exploring RGB-only stream significantly exaggerates the difficulty. We therefore propose a set of algorithmic techniques to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods

MethodsDropout