Learning Efficient Task-Specific Meta-Embeddings with Word Prisms
Jingyi He, KC Tsiolis, Kian Kenyon-Dean, Jackie Chi Kit Cheung

TL;DR
This paper introduces word prisms, a simple and efficient meta-embedding method that learns to combine multiple word embeddings for improved task-specific NLP performance.
Contribution
The paper proposes word prisms, a novel meta-embedding approach using orthogonal transformations for efficient, task-adaptive combination of source embeddings.
Findings
Word prisms outperform other meta-embedding methods on six extrinsic tasks.
Word prisms are computationally efficient at inference time.
Meta-embeddings enhance NLP task performance by leveraging multiple embedding sources.
Abstract
Word embeddings are trained to predict word cooccurrence statistics, which leads them to possess different lexical properties (syntactic, semantic, etc.) depending on the notion of context defined at training time. These properties manifest when querying the embedding space for the most similar vectors, and when used at the input layer of deep neural networks trained to solve downstream NLP problems. Meta-embeddings combine multiple sets of differently trained word embeddings, and have been shown to successfully improve intrinsic and extrinsic performance over equivalent models which use just one set of source embeddings. We introduce word prisms: a simple and efficient meta-embedding method that learns to combine source embeddings according to the task at hand. Word prisms learn orthogonal transformations to linearly combine the input source embeddings, which allows them to be very…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
