Meta-Embedding as Auxiliary Task Regularization

James O' Neill; Danushka Bollegala

arXiv:1809.05886·cs.CL·January 27, 2020

Meta-Embedding as Auxiliary Task Regularization

James O' Neill, Danushka Bollegala

PDF

Open Access

TL;DR

This paper introduces a meta-embedding reconstruction as an auxiliary task to improve word and sentence embedding performance, leveraging labeled datasets for regularization and achieving significant gains in similarity and downstream tasks.

Contribution

It proposes a novel auxiliary task of reconstructing ensemble embeddings to regularize meta-embedding learning, enhancing performance across intrinsic and extrinsic evaluations.

Findings

01

Improved word similarity scores with an average increase of 11.33 in Spearman correlation.

02

Best performance in 4 out of 6 word similarity datasets using cosine and Brier's loss.

03

Enhanced downstream task performance in sequence tagging and sentence classification.

Abstract

Word embeddings have been shown to benefit from ensambling several word embedding sources, often carried out using straightforward mathematical operations over the set of word vectors. More recently, self-supervised learning has been used to find a lower-dimensional representation, similar in size to the individual word embeddings within the ensemble. However, these methods do not use the available manually labeled datasets that are often used solely for the purpose of evaluation. We propose to reconstruct an ensemble of word embeddings as an auxiliary task that regularises a main task while both tasks share the learned meta-embedding layer. We carry out intrinsic evaluation (6 word similarity datasets and 3 analogy datasets) and extrinsic evaluation (4 downstream tasks). For intrinsic task evaluation, supervision comes from various labeled word similarity datasets. Our experimental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies