Learning Sequence Representations by Non-local Recurrent Neural Memory
Wenjie Pei, Xin Feng, Canmiao Fu, Qiong Cao, Guangming Lu, Yu-Wing, Tai

TL;DR
This paper introduces Non-local Recurrent Neural Memory (NRNM), a novel sequence learning model that captures high-order, long-range dependencies using self-attention and gated recurrent mechanisms, outperforming existing methods across various sequence tasks.
Contribution
The paper proposes NRNM, a new model that effectively models high-order and long-range dependencies in sequences through non-local operations and global interactions.
Findings
NRNM outperforms state-of-the-art methods in sequence classification.
NRNM effectively captures long-range dependencies.
NRNM generalizes well across different sequence modalities.
Abstract
The key challenge of sequence representation learning is to capture the long-range temporal dependencies. Typical methods for supervised sequence representation learning are built upon recurrent neural networks to capture temporal dependencies. One potential limitation of these methods is that they only model one-order information interactions explicitly between adjacent time steps in a sequence, hence the high-order interactions between nonadjacent time steps are not fully exploited. It greatly limits the capability of modeling the long-range temporal dependencies since the temporal features learned by one-order interactions cannot be maintained for a long term due to temporal information dilution and gradient vanishing. To tackle this limitation, we propose the Non-local Recurrent Neural Memory (NRNM) for supervised sequence representation learning, which performs non-local operations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Music and Audio Processing · Neural Networks and Applications
