Attentive Tensor Product Learning

Qiuyuan Huang; Li Deng; Dapeng Wu; Chang Liu; Xiaodong He

arXiv:1802.07089·cs.CL·November 30, 2018

Attentive Tensor Product Learning

Qiuyuan Huang, Li Deng, Dapeng Wu, Chang Liu, Xiaodong He

PDF

Open Access

TL;DR

This paper introduces Attentive Tensor Product Learning (ATPL), a neural architecture that combines tensor product representations with attention mechanisms to explicitly model grammatical structures in language tasks.

Contribution

It presents a novel neural network architecture that integrates TPR with attention modules and deep learning models to learn grammatical structures in an unsupervised manner.

Findings

01

Effective in image captioning tasks

02

Improves POS tagging accuracy

03

Enhances constituency parsing performance

Abstract

This paper proposes a new architecture - Attentive Tensor Product Learning (ATPL) - to represent grammatical structures in deep learning models. ATPL is a new architecture to bridge this gap by exploiting Tensor Product Representations (TPR), a structured neural-symbolic model developed in cognitive science, aiming to integrate deep learning with explicit language structures and rules. The key ideas of ATPL are: 1) unsupervised learning of role-unbinding vectors of words via TPR-based deep neural network; 2) employing attention modules to compute TPR; and 3) integration of TPR with typical deep learning architectures including Long Short-Term Memory (LSTM) and Feedforward Neural Network (FFNN). The novelty of our approach lies in its ability to extract the grammatical structure of a sentence by using role-unbinding vectors, which are obtained in an unsupervised manner. This ATPL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques