Class-Imbalanced Graph Learning without Class Rebalancing
Zhining Liu, Ruizhong Qiu, Zhichen Zeng, Hyunsik Yoo, David Zhou, Zhe, Xu, Yada Zhu, Kommy Weldemariam, Jingrui He, Hanghang Tong

TL;DR
This paper identifies topological causes of class imbalance bias in graph learning and introduces BAT, a lightweight augmentation framework that improves performance without class rebalancing, complementing existing methods.
Contribution
The paper reveals topological factors causing class imbalance bias and proposes BAT, a novel augmentation method that enhances existing class rebalancing techniques without requiring reweighting or resampling.
Findings
BAT achieves up to 46.27% performance improvement
BAT reduces bias by up to 72.74%
Compatible with existing class rebalancing methods
Abstract
Class imbalance is prevalent in real-world node classification tasks and poses great challenges for graph learning models. Most existing studies are rooted in a class-rebalancing (CR) perspective and address class imbalance with class-wise reweighting or resampling. In this work, we approach the root cause of class-imbalance bias from an topological paradigm. Specifically, we theoretically reveal two fundamental phenomena in the graph topology that greatly exacerbate the predictive bias stemming from class imbalance. On this basis, we devise a lightweight topological augmentation framework BAT to mitigate the class-imbalance bias without class rebalancing. Being orthogonal to CR, BAT can function as an efficient plug-and-play module that can be seamlessly combined with and significantly boost existing CR techniques. Systematic experiments on real-world imbalanced graph learning tasks…
Peer Reviews
Decision·ICML 2024 Poster
1. Class imbalance is an important issue in the field of graph imbalance learning, which requires deep investigation.
1. The AMP and DMP phenomena are studied in many previous works. The AMP is basically the heterophily issue studied in previous heterophily GNN and graph anomaly detection literature. The DMP is basically the information insufficient issue studied in previous topology imbalance literature. 2. I find the theoretical analysis has nothing to do with the model design. In the theoretical analysis, this work only analyzes the relation between the imbalance ratio and the severity of AMP and DMP. Howeve
S1. Formulas that accurately describe the AMP and DMP problems were derived, providing precise definitions for these problems. S2. Extensive experiments consistently show that the proposed post-processing module significantly enhances the learning effectiveness of the current model across multiple metrics and base models. S3. The writing style is smooth and coherent.
W1. The lack of comparison with other post-processing modules for class imbalance graph learning, such as the classical Residual Propagation method, undermines the persuasiveness of the proposed method's effectiveness. W2. The study lacks a comparison between the proposed method and the predictive performance of the base model on balanced data, which diminishes its persuasiveness. W3. The absence of a comparison with the predictions of the base model on balanced data weakens the persuasivenes
1. The authors provide their codes. 2. It provides some theoretical support for the proposed model. 3. It tests on several widely-used datasets, and the proposed method can sometimes beat the existing methods.
1. The proposed method seems to be meaningless at times. As shown in Table, sometimes the original methods (such as APPNP and GPRGNN) can beat either +ToBE0 or + ToBE1. More over, as shown in Tables 7 and 8, lots of baselines (Vanilla, Reweight, ReNode, and GSMOTE) can sometimes beat either +ToBE0 or + ToBE1. 2. Some grammatical errors, like 1) groundtruth labels –> “ground-truth”; 2) Coauthor networks-> “co-author”; and 3) “Fig. 5 compares” “Figure”.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Computing and Algorithms · Machine Learning in Bioinformatics · Advanced Graph Neural Networks
