Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word   Vector Specialization

Edoardo Maria Ponti; Ivan Vuli\'c; Goran Glava\v{s}; Nikola; Mrk\v{s}i\'c; Anna Korhonen

arXiv:1809.04163·cs.CL·November 2, 2018

Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization

Edoardo Maria Ponti, Ivan Vuli\'c, Goran Glava\v{s}, Nikola, Mrk\v{s}i\'c, Anna Korhonen

PDF

1 Repo

TL;DR

This paper introduces an adversarial post-specialization method that propagates lexical knowledge to enhance full vocabulary word vectors and enables zero-shot cross-lingual transfer without bilingual data.

Contribution

It presents a novel adversarial approach for full-vocabulary specialization and a zero-shot cross-lingual transfer method for word vectors.

Findings

01

Consistent improvements in word similarity, dialog state tracking, and lexical simplification tasks.

02

Effective propagation of lexical knowledge to unseen words in multiple languages.

03

Successful zero-shot transfer of specialization without bilingual data.

Abstract

Semantic specialization is the process of fine-tuning pre-trained distributional word vectors using external lexical knowledge (e.g., WordNet) to accentuate a particular semantic relation in the specialized vector space. While post-processing specialization methods are applicable to arbitrary distributional vectors, they are limited to updating only the vectors of words occurring in external lexicons (i.e., seen words), leaving the vectors of all other words unchanged. We propose a novel approach to specializing the full distributional vocabulary. Our adversarial post-specialization method propagates the external lexical knowledge to the full distributional space. We exploit words seen in the resources as training examples for learning a global specialization function. This function is learned by combining a standard L2-distance loss with an adversarial loss: the adversarial component…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cambridgeltl/adversarial-postspec
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.