Activity Recognition with Moving Cameras and Few Training Examples:   Applications for Detection of Autism-Related Headbanging

Peter Washington; Aaron Kline; Onur Cezmi Mutlu; Emilie Leblanc; Cathy; Hou; Nate Stockham; Kelley Paskov; Brianna Chrisman; Dennis P. Wall

arXiv:2101.03478·cs.CV·January 12, 2021

Activity Recognition with Moving Cameras and Few Training Examples: Applications for Detection of Autism-Related Headbanging

Peter Washington, Aaron Kline, Onur Cezmi Mutlu, Emilie Leblanc, Cathy, Hou, Nate Stockham, Kelley Paskov, Brianna Chrisman, Dennis P. Wall

PDF

TL;DR

This paper presents a novel approach for detecting autism-related headbanging in videos captured with moving cameras, using a head pose keypoint-based CNN-LSTM classifier that performs well with limited training data.

Contribution

It introduces a head pose keypoint feature representation and a CNN-LSTM model for activity recognition in shaky, limited-data videos, specifically targeting autism-related behaviors.

Findings

01

Achieved a mean F1-score of 90.77% on headbanging detection.

02

Demonstrated effectiveness with few training examples and unstable camera footage.

03

Validated the approach on the Self Stimulatory Behaviour Dataset (SSBD).

Abstract

Activity recognition computer vision algorithms can be used to detect the presence of autism-related behaviors, including what are termed "restricted and repetitive behaviors", or stimming, by diagnostic instruments. The limited data that exist in this domain are usually recorded with a handheld camera which can be shaky or even moving, posing a challenge for traditional feature representation approaches for activity detection which mistakenly capture the camera's motion as a feature. To address these issues, we first document the advantages and limitations of current feature representation techniques for activity recognition when applied to head banging detection. We then propose a feature representation consisting exclusively of head pose keypoints. We create a computer vision classifier for detecting head banging in home videos using a time-distributed convolutional neural network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.