Coupled Recurrent Network (CRN)
Lin Sun, Kui Jia, Yuejia Shen, Silvio Savarese, Dit Yan Yeung, and, Bertram E. Shi

TL;DR
The paper introduces a novel Coupled Recurrent Network (CRN) architecture that effectively models reciprocal information from multiple heterogeneous signals in video analysis tasks, outperforming existing methods.
Contribution
The paper proposes the CRN architecture with a Recurrent Interpretation Block (RIB) to better exploit reciprocal signals in multiple input sources for video analysis.
Findings
Achieves state-of-the-art results on human action recognition datasets.
Outperforms existing two-stream RNN architectures.
Effective in multi-person pose estimation tasks.
Abstract
Many semantic video analysis tasks can benefit from multiple, heterogenous signals. For example, in addition to the original RGB input sequences, sequences of optical flow are usually used to boost the performance of human action recognition in videos. To learn from these heterogenous input sources, existing methods reply on two-stream architectural designs that contain independent, parallel streams of Recurrent Neural Networks (RNNs). However, two-stream RNNs do not fully exploit the reciprocal information contained in the multiple signals, let alone exploit it in a recurrent manner. To this end, we propose in this paper a novel recurrent architecture, termed Coupled Recurrent Network (CRN), to deal with multiple input sources. In CRN, the parallel streams of RNNs are coupled together. Key design of CRN is a Recurrent Interpretation Block (RIB) that supports learning of reciprocal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Anomaly Detection Techniques and Applications
