Towards Quantized Model Parallelism for Graph-Augmented MLPs Based on Gradient-Free ADMM Framework
Junxiang Wang, Hongyi Li, Zheng Chai, Yongchao Wang, Yue Cheng and, Liang Zhao

TL;DR
This paper introduces a novel parallel graph deep learning framework using ADMM that enables model parallelism for GA-MLP models, reducing communication costs and improving efficiency while maintaining convergence and performance.
Contribution
It proposes the pdADMM-G framework and its quantized version, pdADMM-G-Q, for efficient model parallelism in GA-MLP models, addressing communication and efficiency challenges.
Findings
Achieves convergence with sublinear rate $o(1/k)$.
Demonstrates significant speedup and improved accuracy over state-of-the-art methods.
Reduces communication overheads by up to 45% without performance loss.
Abstract
While Graph Neural Networks (GNNs) are popular in the deep learning community, they suffer from several challenges including over-smoothing, over-squashing, and gradient vanishing. Recently, a series of models have attempted to relieve these issues by first augmenting the node features and then imposing node-wise functions based on Multi-Layer Perceptron (MLP), which are widely referred to as GA-MLP models. However, while GA-MLP models enjoy deeper architectures for better accuracy, their efficiency largely deteriorates. Moreover, popular acceleration techniques such as stochastic-version or data-parallelism cannot be effectively applied due to the dependency among samples (i.e., nodes) in graphs. To address these issues, in this paper, instead of data parallelism, we propose a parallel graph deep learning Alternating Direction Method of Multipliers (pdADMM-G) framework to achieve model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Stochastic Gradient Optimization Techniques · Advanced Neural Network Applications
MethodsStochastic Gradient Descent
