Synthetic Over-sampling for Imbalanced Node Classification with Graph Neural Networks
Tianxiang Zhao, Xiang Zhang, Suhang Wang

TL;DR
This paper proposes a novel graph over-sampling method that synthesizes high-quality minority class nodes in an embedding space, addressing class imbalance in GNNs and improving node classification performance.
Contribution
It introduces a new over-sampling technique that generates synthetic nodes considering graph structure and attributes, enhancing class balance in GNN training.
Findings
Improved classification accuracy on imbalanced graph datasets
Effective generation of high-quality synthetic minority nodes
Enhanced data efficiency through mixed node synthesis
Abstract
In recent years, graph neural networks (GNNs) have achieved state-of-the-art performance for node classification. However, most existing GNNs would suffer from the graph imbalance problem. In many real-world scenarios, node classes are imbalanced, with some majority classes making up most parts of the graph. The message propagation mechanism in GNNs would further amplify the dominance of those majority classes, resulting in sub-optimal classification performance. In this work, we seek to address this problem by generating pseudo instances of minority classes to balance the training data, extending previous over-sampling-based techniques. This task is non-trivial, as those techniques are designed with the assumption that instances are independent. Neglection of relation information would complicate this oversampling process. Furthermore, the node classification task typically takes the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Artificial Intelligence in Healthcare · Brain Tumor Detection and Classification
