Learning Universal Representations from Word to Sentence

Yian Li; Hai Zhao

arXiv:2009.04656·cs.CL·September 11, 2020

Learning Universal Representations from Word to Sentence

Yian Li, Hai Zhao

PDF

Open Access

TL;DR

This paper proposes a method for learning universal linguistic representations across words, phrases, and sentences in a shared vector space, enabling more flexible language understanding and processing.

Contribution

It introduces a universal representation learning approach, constructs analogy datasets across linguistic levels, and demonstrates that pre-trained Transformer models can effectively produce such representations.

Findings

01

Pre-trained Transformer models can generate universal embeddings for multiple linguistic levels.

02

Fine-tuning ALBERT on NLI and PPDB datasets improves analogy task accuracy.

03

Universal representations enhance real-world NLP applications like FAQ understanding.

Abstract

Despite the well-developed cut-edge representation learning for language, most language representation models usually focus on specific level of linguistic unit, which cause great inconvenience when being confronted with handling multiple layers of linguistic objects in a unified way. Thus this work introduces and explores the universal representation learning, i.e., embeddings of different levels of linguistic unit in a uniform vector space through a task-independent evaluation. We present our approach of constructing analogy datasets in terms of words, phrases and sentences and experiment with multiple representation models to examine geometric properties of the learned vector space. Then we empirically verify that well pre-trained Transformer models incorporated with appropriate training settings may effectively yield universal representation. Especially, our implementation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · WordPiece · LAMB · Refunds@Expedia|||How do I get a full refund from Expedia? · ALBERT · Layer Normalization · Dropout · Dense Connections