Joint Word Representation Learning using a Corpus and a Semantic Lexicon

Danushka Bollegala; Alsuhaibani Mohammed; Takanori Maehara; Ken-ichi; Kawarabayashi

arXiv:1511.06438·cs.CL·November 23, 2015·40 cites

Joint Word Representation Learning using a Corpus and a Semantic Lexicon

Danushka Bollegala, Alsuhaibani Mohammed, Takanori Maehara, Ken-ichi, Kawarabayashi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a joint learning approach that combines corpus-based co-occurrence data with semantic lexicon constraints to produce improved word representations for NLP tasks.

Contribution

It proposes a novel method that integrates semantic lexicon information into word embedding learning, enhancing the quality of representations over previous approaches.

Findings

01

Significantly outperforms previous methods on semantic similarity tasks.

02

Achieves better results on word analogy benchmarks.

03

Effectively incorporates semantic relations into embeddings.

Abstract

Methods for learning word representations using large text corpora have received much attention lately due to their impressive performance in numerous natural language processing (NLP) tasks such as, semantic similarity measurement, and word analogy detection. Despite their success, these data-driven word representation learning methods do not consider the rich semantic relational structure between words in a co-occurring context. On the other hand, already much manual effort has gone into the construction of semantic lexicons such as the WordNet that represent the meanings of words by defining the various relationships that exist among the words in a language. We consider the question, can we improve the word representations learnt using a corpora by integrating the knowledge from semantic lexicons?. For this purpose, we propose a joint word representation learning method that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Bollegala/jointreps
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques