GLoMo: Unsupervisedly Learned Relational Graphs as Transferable   Representations

Zhilin Yang; Jake Zhao; Bhuwan Dhingra; Kaiming He; William W. Cohen,; Ruslan Salakhutdinov; Yann LeCun

arXiv:1806.05662·cs.LG·July 4, 2018·30 cites

GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations

Zhilin Yang, Jake Zhao, Bhuwan Dhingra, Kaiming He, William W. Cohen,, Ruslan Salakhutdinov, Yann LeCun

PDF

Open Access 1 Repo

TL;DR

This paper introduces GLoMo, a framework for learning and transferring latent relational graphs from large-scale unlabeled data, enhancing performance across diverse NLP and vision tasks beyond traditional feature transfer methods.

Contribution

It proposes a novel method for unsupervised learning of relational graphs that are transferable across different data modalities and embedding types, extending transfer learning capabilities.

Findings

01

Improved performance on question answering, natural language inference, sentiment analysis, and image classification.

02

Relational graphs are transferable across different embeddings and even embedding-free data.

03

The learned graphs generalize well to various downstream tasks.

Abstract

Modern deep transfer learning approaches have mainly focused on learning generic feature vectors from one task that are transferable to other tasks, such as word embeddings in language and pretrained convolutional features in vision. However, these approaches usually transfer unary features and largely ignore more structured graphical representations. This work explores the possibility of learning generic latent relational graphs that capture dependencies between pairs of data units (e.g., words or pixels) from large-scale unlabeled data and transferring the graphs to downstream tasks. Our proposed transfer learning framework improves performance on various tasks including question answering, natural language inference, sentiment analysis, and image classification. We also show that the learned graphs are generic enough to be transferred to different embeddings on which the graphs have…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YJHMITWEB/GLoMo-tensorflow
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Graph Theory and Algorithms

MethodsSigmoid Activation · Tanh Activation · GloVe Embeddings · Long Short-Term Memory · Bidirectional LSTM · Softmax · ELMo