Real-time Action Recognition with Dissimilarity-based Training of Specialized Module Networks
Marian K.Y. Boktor, Ahmad Al-Kabbany, Radwa Khalil, Said, El-Khamy

TL;DR
This paper introduces a novel approach for real-time action recognition by training multiple specialized networks on subsets of classes, achieving high accuracy with efficient computation on standard datasets.
Contribution
It proposes a dissimilarity-based class distribution method for training specialized module networks, improving efficiency and potentially enabling lightweight architectures.
Findings
Achieved 72.5% accuracy on HMDB-51 dataset.
Demonstrated comparable or superior performance to state-of-the-art methods.
Satisfies real-time processing constraints.
Abstract
This paper addresses the problem of real-time action recognition in trimmed videos, for which deep neural networks have defined the state-of-the-art performance in the recent literature. For attaining higher recognition accuracies with efficient computations, researchers have addressed the various aspects of limitations in the recognition pipeline. This includes network architecture, the number of input streams (where additional streams augment the color information), the cost function to be optimized, in addition to others. The literature has always aimed, though, at assigning the adopted network (or networks, in case of multiple streams) the task of recognizing the whole number of action classes in the dataset at hand. We propose to train multiple specialized module networks instead. Each module is trained to recognize a subset of the action classes. Towards this goal, we present a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Gait Recognition and Analysis
