# Enhancing gesture recognition with multiscale feature extraction and spatial attention

**Authors:** Jingpeng Lei

PMC · DOI: 10.1371/journal.pone.0324050 · 2025-06-09

## TL;DR

This paper presents a new gesture recognition method using multiscale features and spatial attention to improve accuracy and robustness.

## Contribution

A novel approach combining multiscale feature extraction and spatial attention for enhanced gesture recognition.

## Key findings

- The proposed method outperforms existing techniques on benchmark datasets in terms of accuracy.
- Multiscale feature extraction provides a more holistic representation of gestures.
- Spatial attention improves focus on relevant image regions for better discrimination.

## Abstract

Gesture recognition technology is a pivotal element in human-computer interaction, enabling users to communicate with machines in a natural and intuitive manner. This paper introduces a novel approach to gesture recognition that enhances accuracy and robustness by integrating multiscale feature extraction and spatial attention mechanisms. Specifically, we have developed a multiscale feature extraction module inspired by the Inception architecture, which captures comprehensive features across various scales, providing a more holistic feature representation. Additionally, We incorporate a spatial attention mechanism that focuses on image regions most relevant to the current gesture, thereby improving the discriminative power of the features. Extensive experiments conducted on multiple benchmark datasets demonstrate that our method significantly outperforms existing gesture recognition techniques in terms of accuracy.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

47 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12148073/full.md

---
Source: https://tomesphere.com/paper/PMC12148073