Modeling Local Dependence in Natural Language with Multi-channel Recurrent Neural Networks
Chang Xu, Weiran Huang, Hongwei Wang, Gang Wang, Tie-Yan Liu

TL;DR
This paper introduces Multi-Channel RNN (MC-RNN), an advanced neural network model that dynamically captures local semantic structures in sentences, leading to improved performance across various NLP tasks.
Contribution
The paper proposes MC-RNN, a novel RNN variant with multiple channels and an attention mechanism to adaptively model local dependence patterns in natural language.
Findings
MC-RNN outperforms existing models on machine translation, summarization, and language modeling.
Significant improvements observed in experimental results across multiple NLP tasks.
Adaptive structure selection enhances the modeling of local semantic dependencies.
Abstract
Recurrent Neural Networks (RNNs) have been widely used in processing natural language tasks and achieve huge success. Traditional RNNs usually treat each token in a sentence uniformly and equally. However, this may miss the rich semantic structure information of a sentence, which is useful for understanding natural languages. Since semantic structures such as word dependence patterns are not parameterized, it is a challenge to capture and leverage structure information. In this paper, we propose an improved variant of RNN, Multi-Channel RNN (MC-RNN), to dynamically capture and leverage local semantic structure information. Concretely, MC-RNN contains multiple channels, each of which represents a local dependence pattern at a time. An attention mechanism is introduced to combine these patterns at each step, according to the semantic information. Then we parameterize structure information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Neural Networks and Applications
