Visual intelligence for efficient human action recognition in human computers interaction applications

Noorah Alghasham; Waleed Albattah

PMC · DOI:10.1371/journal.pone.0343132·March 5, 2026

Visual intelligence for efficient human action recognition in human computers interaction applications

Noorah Alghasham, Waleed Albattah

PDF

Open Access

TL;DR

This paper introduces a deep learning model combining CNNs and RNNs for efficient and accurate human action recognition in human-computer interaction.

Contribution

A novel HAR model using EfficientNetB7 and LSTM for high accuracy and low computational cost without data augmentation.

Findings

01

The model achieved 97.8% accuracy on the UCF101 dataset.

02

It outperformed existing models on the HMDB51 dataset with 80.1% accuracy.

03

The model reduces computational complexity and avoids the need for data augmentation.

Abstract

Human Action Recognition (HAR) is a pivotal area in computer vision, video surveillance, and human-computer interaction (HCI), driven by the need for efficient and accurate models to enhance HCI experiences. Traditional HAR methods often rely on hand-crafted features and shallow learning techniques, which limits their ability to capture complex patterns. In contrast, this study proposes an efficient HAR model that leverages deep neural networks, specifically a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs), to enhance HCI through AI-powered action understanding. The model employs a pre-trained EfficientNetB7 network to extract rich spatial features from video frames, followed by a Long Short-Term Memory (LSTM) network to capture long-range temporal dependencies. This architecture enhances recognition accuracy while reducing computational…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Cell lines2

LSTM— Homo sapiens (Human) · Transformed cell line UCF101— Mus musculus (Mouse) · Hybridoma

Chemicals1

RNN

Diseases1

HAR

Figures19

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Multimodal Machine Learning Applications