Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical   Resources

Ivan Vuli\'c; Goran Glava\v{s}; Nikola Mrk\v{s}i\'c; Anna Korhonen

arXiv:1805.03228·cs.CL·May 10, 2018

Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources

Ivan Vuli\'c, Goran Glava\v{s}, Nikola Mrk\v{s}i\'c, Anna Korhonen

PDF

1 Repo

TL;DR

This paper introduces a novel post-specialisation method that propagates external lexical knowledge to unseen words in word vector spaces using a deep neural network, improving their representations for various NLP tasks across multiple languages.

Contribution

It extends word vector specialisation to unseen words by learning a non-linear transformation, enhancing the entire vocabulary's quality in distributional spaces.

Findings

01

Significant improvements in word similarity tasks

02

Enhanced performance in dialogue state tracking

03

Better results in lexical text simplification

Abstract

Word vector specialisation (also known as retrofitting) is a portable, light-weight approach to fine-tuning arbitrary distributional word vector spaces by injecting external knowledge from rich lexical resources such as WordNet. By design, these post-processing methods only update the vectors of words occurring in external lexicons, leaving the representations of all unseen words intact. In this paper, we show that constraint-driven vector space specialisation can be extended to unseen words. We propose a novel post-specialisation method that: a) preserves the useful linguistic knowledge for seen words; while b) propagating this external signal to unseen words in order to improve their vector representations as well. Our post-specialisation approach explicits a non-linear specialisation function in the form of a deep neural network by learning to predict specialised vectors from their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cambridgeltl/post-specialisation
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.