Text Information Aggregation with Centrality Attention
Jingjing Gong, Hang Yan, Yining Zheng, Xipeng Qiu, Xuanjing Huang

TL;DR
This paper introduces eigen-centrality self-attention, a novel text aggregation method that models higher-order word relationships via graph eigen-centrality, improving performance in text classification and natural language inference tasks.
Contribution
It proposes a new eigen-centrality based self-attention mechanism that explicitly captures higher-order dependencies among words in text sequences.
Findings
Achieved better results than baseline models in 5 text classification tasks.
Improved performance on SNLI natural language inference task.
Developed an efficient iterative method for gradient computation of the power method.
Abstract
A lot of natural language processing problems need to encode the text sequence as a fix-length vector, which usually involves aggregation process of combining the representations of all the words, such as pooling or self-attention. However, these widely used aggregation approaches did not take higher-order relationship among the words into consideration. Hence we propose a new way of obtaining aggregation weights, called eigen-centrality self-attention. More specifically, we build a fully-connected graph for all the words in a sentence, then compute the eigen-centrality as the attention score of each word. The explicit modeling of relationships as a graph is able to capture some higher-order dependency among words, which helps us achieve better results in 5 text classification tasks and one SNLI task than baseline models such as pooling, self-attention and dynamic routing. Besides, in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Complex Network Analysis Techniques · Sentiment Analysis and Opinion Mining
