Comprehend DeepWalk as Matrix Factorization

Cheng Yang; Zhiyuan Liu

arXiv:1501.00358·cs.LG·January 5, 2015·20 cites

Comprehend DeepWalk as Matrix Factorization

Cheng Yang, Zhiyuan Liu

PDF

Open Access

TL;DR

This paper reveals that DeepWalk, a popular node embedding algorithm for graphs, can be understood as matrix factorization of a specific matrix related to random walk probabilities, connecting it to word embedding techniques.

Contribution

It formally proves that DeepWalk is equivalent to factorizing a matrix derived from random walk probabilities, bridging graph embedding and matrix factorization methods.

Findings

01

DeepWalk factors a matrix of random walk probabilities.

02

The matrix M entries are logarithms of average walk probabilities.

03

This insight unifies DeepWalk with matrix factorization techniques.

Abstract

Word2vec, as an efficient tool for learning vector representation of words has shown its effectiveness in many natural language processing tasks. Mikolov et al. issued Skip-Gram and Negative Sampling model for developing this toolbox. Perozzi et al. introduced the Skip-Gram model into the study of social network for the first time, and designed an algorithm named DeepWalk for learning node embedding on a graph. We prove that the DeepWalk algorithm is actually factoring a matrix M where each entry M_{ij} is logarithm of the average probability that node i randomly walks to node j in fix steps.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsDeepWalk