Human-like Relational Models for Activity Recognition in Video
Joseph Chrol-Cannon, Andrew Gilbert, Ranko Lazic, Adithya, Madhusoodanan, Frank Guerin

TL;DR
This paper introduces a human-inspired method for activity recognition in videos that emphasizes understanding critical object relationships in sequential phases, leading to improved robustness over traditional neural networks.
Contribution
The paper proposes a novel approach that interprets videos in phases and extracts specific object relationships, using random forests for classification, inspired by human perception.
Findings
Achieved more robust performance on challenging activities
Outperformed neural network baselines on a subset of the something-something dataset
Demonstrated the importance of relational understanding in activity recognition
Abstract
Video activity recognition by deep neural networks is impressive for many classes. However, it falls short of human performance, especially for challenging to discriminate activities. Humans differentiate these complex activities by recognising critical spatio-temporal relations among explicitly recognised objects and parts, for example, an object entering the aperture of a container. Deep neural networks can struggle to learn such critical relationships effectively. Therefore we propose a more human-like approach to activity recognition, which interprets a video in sequential temporal phases and extracts specific relationships among objects and hands in those phases. Random forest classifiers are learnt from these extracted relationships. We apply the method to a challenging subset of the something-something dataset and achieve a more robust performance against neural network baselines…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Gait Recognition and Analysis
