word2vec Explained: deriving Mikolov et al.'s negative-sampling   word-embedding method

Yoav Goldberg; Omer Levy

arXiv:1402.3722·cs.CL·February 18, 2014·1.3k cites

word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method

Yoav Goldberg, Omer Levy

PDF

Open Access 5 Repos

TL;DR

This paper clarifies the negative sampling method used in Mikolov et al.'s word2vec model, making the underlying equations and rationale more accessible to researchers and practitioners.

Contribution

It provides a detailed explanation and derivation of the negative sampling technique in Mikolov et al.'s word2vec, which was previously cryptic.

Findings

01

Clarified the mathematical derivation of negative sampling

02

Improved understanding of word2vec training process

03

Enhanced accessibility of the model's core equations

Abstract

The word2vec software of Tomas Mikolov and colleagues (https://code.google.com/p/word2vec/ ) has gained a lot of traction lately, and provides state-of-the-art word embeddings. The learning models behind the software are described in two research papers. We found the description of the models in these papers to be somewhat cryptic and hard to follow. While the motivations and presentation may be obvious to the neural-networks language-modeling crowd, we had to struggle quite a bit to figure out the rationale behind the equations. This note is an attempt to explain equation (4) (negative sampling) in "Distributed Representations of Words and Phrases and their Compositionality" by Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado and Jeffrey Dean.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques