Video Action Recognition Using spatio-temporal optical flow video frames

Aytekin Nebisoy; Saber Malekzadeh

arXiv:2103.05101·cs.CV·March 10, 2021

Video Action Recognition Using spatio-temporal optical flow video frames

Aytekin Nebisoy, Saber Malekzadeh

PDF

TL;DR

This paper presents a deep neural network approach for human action recognition in videos, utilizing RGB and optical flow data to achieve high accuracy despite challenges like cluttered backgrounds and camera movement.

Contribution

It introduces a deep learning model that combines spatial and temporal features from RGB and optical flow videos for improved action classification.

Findings

01

Achieved approximately 94% recognition accuracy.

02

Effectively handled background clutter and viewpoint variations.

03

Utilized combined RGB and optical flow inputs for better performance.

Abstract

Recognizing human actions based on videos has became one of the most popular areas of research in computer vision in recent years. This area has many applications such as surveillance, robotics, health care, video search and human-computer interaction. There are many problems associated with recognizing human actions in videos such as cluttered backgrounds, obstructions, viewpoints variation, execution speed and camera movement. A large number of methods have been proposed to solve the problems. This paper focus on spatial and temporal pattern recognition for the classification of videos using Deep Neural Networks. This model takes RGB images and Optical Flow as input data and outputs an action class number. The final recognition accuracy was about 94%.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.