Understanding Negative Sampling in Graph Representation Learning
Zhen Yang, Ming Ding, Chang Zhou, Hongxia Yang, Jingren Zhou, Jie, Tang

TL;DR
This paper provides a theoretical analysis of negative sampling in graph representation learning, highlighting its importance and proposing a new method to improve sampling efficiency and effectiveness.
Contribution
It introduces a theoretical framework for negative sampling, deriving optimal distribution properties, and proposes MCNS to enhance sampling through self-contrast and Metropolis-Hastings.
Findings
Negative sampling distribution should be positively but sub-linearly correlated to positive distribution.
The proposed MCNS method improves sampling efficiency and robustness.
Experimental results outperform existing methods across multiple tasks and datasets.
Abstract
Graph representation learning has been extensively studied in recent years. Despite its potential in generating continuous embeddings for various networks, both the effectiveness and efficiency to infer high-quality representations toward large corpus of nodes are still challenging. Sampling is a critical point to achieve the performance goals. Prior arts usually focus on sampling positive node pairs, while the strategy for negative sampling is left insufficiently explored. To bridge the gap, we systematically analyze the role of negative sampling from the perspectives of both objective and risk, theoretically demonstrating that negative sampling is as important as positive sampling in determining the optimization objective and the resulted variance. To the best of our knowledge, we are the first to derive the theory and quantify that the negative sampling distribution should be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Complex Network Analysis Techniques
