Exploring Sub-skeleton Trajectories for Interpretable Recognition of   Sign Language

Joachim Gudmundsson; Martin P. Seybold; John Pfeifer

arXiv:2202.01390·cs.CV·February 4, 2022

Exploring Sub-skeleton Trajectories for Interpretable Recognition of Sign Language

Joachim Gudmundsson, Martin P. Seybold, John Pfeifer

PDF

Open Access

TL;DR

This paper introduces a simple, scalable method using sub-skeleton trajectories and natural distance measures for interpretable sign language recognition, outperforming recent state-of-the-art approaches across various datasets.

Contribution

The paper proposes a novel geometric feature space called sub-skeletons and a natural distance measure for sign language recognition, demonstrating improved accuracy and interpretability.

Findings

01

Simple methods outperform recent state-of-the-art approaches.

02

Effective across different data domains and tracking technologies.

03

Boosted variation further improves recognition accuracy.

Abstract

Recent advances in tracking sensors and pose estimation software enable smart systems to use trajectories of skeleton joint locations for supervised learning. We study the problem of accurately recognizing sign language words, which is key to narrowing the communication gap between hard and non-hard of hearing people. Our method explores a geometric feature space that we call `sub-skeleton' aspects of movement. We assess similarity of feature space trajectories using natural, speed invariant distance measures, which enables clear and insightful nearest neighbor classification. The simplicity and scalability of our basic method allows for immediate application in different data domains with little to no parameter tuning. We demonstrate the effectiveness of our basic method, and a boosted variation, with experiments on data from different application domains and tracking technologies.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Gait Recognition and Analysis

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings