Attentive Tensor Product Learning
Qiuyuan Huang, Li Deng, Dapeng Wu, Chang Liu, Xiaodong He

TL;DR
This paper introduces Attentive Tensor Product Learning (ATPL), a neural architecture that combines tensor product representations with attention mechanisms to explicitly model grammatical structures in language tasks.
Contribution
It presents a novel neural network architecture that integrates TPR with attention modules and deep learning models to learn grammatical structures in an unsupervised manner.
Findings
Effective in image captioning tasks
Improves POS tagging accuracy
Enhances constituency parsing performance
Abstract
This paper proposes a new architecture - Attentive Tensor Product Learning (ATPL) - to represent grammatical structures in deep learning models. ATPL is a new architecture to bridge this gap by exploiting Tensor Product Representations (TPR), a structured neural-symbolic model developed in cognitive science, aiming to integrate deep learning with explicit language structures and rules. The key ideas of ATPL are: 1) unsupervised learning of role-unbinding vectors of words via TPR-based deep neural network; 2) employing attention modules to compute TPR; and 3) integration of TPR with typical deep learning architectures including Long Short-Term Memory (LSTM) and Feedforward Neural Network (FFNN). The novelty of our approach lies in its ability to extract the grammatical structure of a sentence by using role-unbinding vectors, which are obtained in an unsupervised manner. This ATPL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
