TL;DR
This paper introduces an adversarial post-specialization method that propagates lexical knowledge to enhance full vocabulary word vectors and enables zero-shot cross-lingual transfer without bilingual data.
Contribution
It presents a novel adversarial approach for full-vocabulary specialization and a zero-shot cross-lingual transfer method for word vectors.
Findings
Consistent improvements in word similarity, dialog state tracking, and lexical simplification tasks.
Effective propagation of lexical knowledge to unseen words in multiple languages.
Successful zero-shot transfer of specialization without bilingual data.
Abstract
Semantic specialization is the process of fine-tuning pre-trained distributional word vectors using external lexical knowledge (e.g., WordNet) to accentuate a particular semantic relation in the specialized vector space. While post-processing specialization methods are applicable to arbitrary distributional vectors, they are limited to updating only the vectors of words occurring in external lexicons (i.e., seen words), leaving the vectors of all other words unchanged. We propose a novel approach to specializing the full distributional vocabulary. Our adversarial post-specialization method propagates the external lexical knowledge to the full distributional space. We exploit words seen in the resources as training examples for learning a global specialization function. This function is learned by combining a standard L2-distance loss with an adversarial loss: the adversarial component…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
