Human-like Relational Models for Activity Recognition in Video

Joseph Chrol-Cannon; Andrew Gilbert; Ranko Lazic; Adithya; Madhusoodanan; Frank Guerin

arXiv:2107.05319·cs.CV·January 12, 2022

Human-like Relational Models for Activity Recognition in Video

Joseph Chrol-Cannon, Andrew Gilbert, Ranko Lazic, Adithya, Madhusoodanan, Frank Guerin

PDF

Open Access

TL;DR

This paper introduces a human-inspired method for activity recognition in videos that emphasizes understanding critical object relationships in sequential phases, leading to improved robustness over traditional neural networks.

Contribution

The paper proposes a novel approach that interprets videos in phases and extracts specific object relationships, using random forests for classification, inspired by human perception.

Findings

01

Achieved more robust performance on challenging activities

02

Outperformed neural network baselines on a subset of the something-something dataset

03

Demonstrated the importance of relational understanding in activity recognition

Abstract

Video activity recognition by deep neural networks is impressive for many classes. However, it falls short of human performance, especially for challenging to discriminate activities. Humans differentiate these complex activities by recognising critical spatio-temporal relations among explicitly recognised objects and parts, for example, an object entering the aperture of a container. Deep neural networks can struggle to learn such critical relationships effectively. Therefore we propose a more human-like approach to activity recognition, which interprets a video in sequential temporal phases and extracts specific relationships among objects and hands in those phases. Random forest classifiers are learnt from these extracted relationships. We apply the method to a challenging subset of the something-something dataset and achieve a more robust performance against neural network baselines…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Gait Recognition and Analysis