Information Aggregation via Dynamic Routing for Sequence Encoding
Jingjing Gong, Xipeng Qiu, Shaojing Wang, Xuanjing Huang

TL;DR
This paper introduces a dynamic routing-based aggregation mechanism for sequence encoding that adaptively transfers information from word vectors to a fixed-size representation, improving performance on text classification tasks.
Contribution
It proposes a novel dynamic routing aggregation method inspired by Capsule Networks, enhancing sequence encoding by task-guided information transfer.
Findings
Outperforms traditional pooling methods on five text classification tasks
Refines message passing based on the final encoding state
Demonstrates significant performance improvements
Abstract
While much progress has been made in how to encode a text sequence into a sequence of vectors, less attention has been paid to how to aggregate these preceding vectors (outputs of RNN/CNN) into fixed-size encoding vector. Usually, a simple max or average pooling is used, which is a bottom-up and passive way of aggregation and lack of guidance by task information. In this paper, we propose an aggregation mechanism to obtain a fixed-size encoding with a dynamic routing policy. The dynamic routing policy is dynamically deciding that what and how much information need be transferred from each word to the final encoding of the text sequence. Following the work of Capsule Network, we design two dynamic routing policies to aggregate the outputs of RNN/CNN encoding layer into a final encoding vector. Compared to the other aggregation methods, dynamic routing can refine the messages according to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Advanced Malware Detection Techniques · Machine Learning and Data Classification
MethodsCapsule Network · Average Pooling
