A La Carte Embedding: Cheap but Effective Induction of Semantic Feature   Vectors

Mikhail Khodak; Nikunj Saunshi; Yingyu Liang; Tengyu Ma; Brandon; Stewart; Sanjeev Arora

arXiv:1805.05388·cs.CL·May 16, 2018

A La Carte Embedding: Cheap but Effective Induction of Semantic Feature Vectors

Mikhail Khodak, Nikunj Saunshi, Yingyu Liang, Tengyu Ma, Brandon, Stewart, Sanjeev Arora

PDF

1 Repo

TL;DR

This paper presents a simple, efficient method called a la carte embedding for inducing semantic feature vectors for rare or unseen words and features, leveraging pretrained embeddings and linear regression.

Contribution

It introduces a novel linear transformation approach that enables quick, on-the-fly embedding induction for new textual features using minimal data, outperforming existing methods.

Findings

01

Requires fewer examples to learn high-quality embeddings

02

Achieves state-of-the-art results on nonce tasks

03

Effective for unsupervised document classification

Abstract

Motivations like domain adaptation, transfer learning, and feature learning have fueled interest in inducing embeddings for rare or unseen words, n-grams, synsets, and other textual features. This paper introduces a la carte embedding, a simple and general alternative to the usual word2vec-based approaches for building such representations that is based upon recent theoretical results for GloVe-like embeddings. Our method relies mainly on a linear transformation that is efficiently learnable using pretrained word vectors and linear regression. This transform is applicable on the fly in the future when a new text feature or rare word is encountered, even if only a single usage example is available. We introduce a new dataset showing how the a la carte method requires fewer examples of words in context to learn high-quality embeddings and we obtain state-of-the-art results on a nonce task…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NLPrinceton/ALaCarte
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.