Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints
Nikola Mrk\v{s}i\'c, Ivan Vuli\'c, Diarmuid \'O S\'eaghdha, Ira, Leviant, Roi Reichart, Milica Ga\v{s}i\'c, Anna Korhonen, Steve Young

TL;DR
This paper introduces Attract-Repel, an algorithm that enhances word vector semantic quality using lexical constraints, improving cross-lingual semantic tasks and dialogue state tracking across multiple languages.
Contribution
The paper presents a novel algorithm, Attract-Repel, that injects monolingual and cross-lingual constraints into word vectors, producing high-quality, semantically specialised multilingual vector spaces.
Findings
State-of-the-art results on semantic similarity in six languages
Improved dialogue state tracking performance across multiple languages
Cross-lingual vector spaces facilitate multilingual DST training
Abstract
We present Attract-Repel, an algorithm for improving the semantic quality of word vectors by injecting constraints extracted from lexical resources. Attract-Repel facilitates the use of constraints from mono- and cross-lingual resources, yielding semantically specialised cross-lingual vector spaces. Our evaluation shows that the method can make use of existing cross-lingual lexicons to construct high-quality vector spaces for a plethora of different languages, facilitating semantic transfer from high- to lower-resource ones. The effectiveness of our approach is demonstrated with state-of-the-art results on semantic similarity datasets in six languages. We next show that Attract-Repel-specialised vectors boost performance in the downstream task of dialogue state tracking (DST) across multiple languages. Finally, we show that cross-lingual vector spaces produced by our algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
