Cooperative Training of Deep Aggregation Networks for RGB-D Action Recognition
Pichao Wang, Wanqing Li, Jun Wan, Philip Ogunbona, Xinwang, Liu

TL;DR
This paper introduces a cooperative deep learning framework that jointly trains a neural network on RGB and depth data for action recognition, improving discriminative features and modality robustness.
Contribution
It proposes a novel c-ConvNet that combines ranking and softmax losses to enhance feature discrimination and modality correlation in RGB-D action recognition.
Findings
Achieved state-of-the-art results on multiple RGB-D datasets.
Effectively reduces intra- and cross-modality feature variations.
Embeds modality correlations that improve recognition even with single modality.
Abstract
A novel deep neural network training paradigm that exploits the conjoint information in multiple heterogeneous sources is proposed. Specifically, in a RGB-D based action recognition task, it cooperatively trains a single convolutional neural network (named c-ConvNet) on both RGB visual features and depth features, and deeply aggregates the two kinds of features for action recognition. Differently from the conventional ConvNet that learns the deep separable features for homogeneous modality-based classification with only one softmax loss function, the c-ConvNet enhances the discriminative power of the deeply learned features and weakens the undesired modality discrepancy by jointly optimizing a ranking loss and a softmax loss for both homogeneous and heterogeneous modalities. The ranking loss consists of intra-modality and cross-modality triplet losses, and it reduces both the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Hand Gesture Recognition Systems
MethodsSoftmax
