Joint Word Representation Learning using a Corpus and a Semantic Lexicon
Danushka Bollegala, Alsuhaibani Mohammed, Takanori Maehara, Ken-ichi, Kawarabayashi

TL;DR
This paper introduces a joint learning approach that combines corpus-based co-occurrence data with semantic lexicon constraints to produce improved word representations for NLP tasks.
Contribution
It proposes a novel method that integrates semantic lexicon information into word embedding learning, enhancing the quality of representations over previous approaches.
Findings
Significantly outperforms previous methods on semantic similarity tasks.
Achieves better results on word analogy benchmarks.
Effectively incorporates semantic relations into embeddings.
Abstract
Methods for learning word representations using large text corpora have received much attention lately due to their impressive performance in numerous natural language processing (NLP) tasks such as, semantic similarity measurement, and word analogy detection. Despite their success, these data-driven word representation learning methods do not consider the rich semantic relational structure between words in a co-occurring context. On the other hand, already much manual effort has gone into the construction of semantic lexicons such as the WordNet that represent the meanings of words by defining the various relationships that exist among the words in a language. We consider the question, can we improve the word representations learnt using a corpora by integrating the knowledge from semantic lexicons?. For this purpose, we propose a joint word representation learning method that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
