TL;DR
GraphSMOTE introduces a novel embedding-based over-sampling framework for imbalanced node classification in graphs, effectively synthesizing new minority samples with relation information to improve GNN performance.
Contribution
The paper presents GraphSMOTE, a new method that extends synthetic minority over-sampling to graph data by constructing an embedding space and training an edge generator.
Findings
Outperforms baseline methods on three datasets.
Effectively synthesizes realistic minority class samples.
Improves GNN classification accuracy significantly.
Abstract
Node classification is an important research topic in graph learning. Graph neural networks (GNNs) have achieved state-of-the-art performance of node classification. However, existing GNNs address the problem where node samples for different classes are balanced; while for many real-world scenarios, some classes may have much fewer instances than others. Directly training a GNN classifier in this case would under-represent samples from those minority classes and result in sub-optimal performance. Therefore, it is very important to develop GNNs for imbalanced node classification. However, the work on this is rather limited. Hence, we seek to extend previous imbalanced learning techniques for i.i.d data to the imbalanced node classification task to facilitate GNN classifiers. In particular, we choose to adopt synthetic minority over-sampling algorithms, as they are found to be the most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
