Towards Quantized Model Parallelism for Graph-Augmented MLPs Based on   Gradient-Free ADMM Framework

Junxiang Wang; Hongyi Li; Zheng Chai; Yongchao Wang; Yue Cheng and; Liang Zhao

arXiv:2105.09837·cs.LG·November 18, 2022

Towards Quantized Model Parallelism for Graph-Augmented MLPs Based on Gradient-Free ADMM Framework

Junxiang Wang, Hongyi Li, Zheng Chai, Yongchao Wang, Yue Cheng and, Liang Zhao

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel parallel graph deep learning framework using ADMM that enables model parallelism for GA-MLP models, reducing communication costs and improving efficiency while maintaining convergence and performance.

Contribution

It proposes the pdADMM-G framework and its quantized version, pdADMM-G-Q, for efficient model parallelism in GA-MLP models, addressing communication and efficiency challenges.

Findings

01

Achieves convergence with sublinear rate $o(1/k)$.

02

Demonstrates significant speedup and improved accuracy over state-of-the-art methods.

03

Reduces communication overheads by up to 45% without performance loss.

Abstract

While Graph Neural Networks (GNNs) are popular in the deep learning community, they suffer from several challenges including over-smoothing, over-squashing, and gradient vanishing. Recently, a series of models have attempted to relieve these issues by first augmenting the node features and then imposing node-wise functions based on Multi-Layer Perceptron (MLP), which are widely referred to as GA-MLP models. However, while GA-MLP models enjoy deeper architectures for better accuracy, their efficiency largely deteriorates. Moreover, popular acceleration techniques such as stochastic-version or data-parallelism cannot be effectively applied due to the dependency among samples (i.e., nodes) in graphs. To address these issues, in this paper, instead of data parallelism, we propose a parallel graph deep learning Alternating Direction Method of Multipliers (pdADMM-G) framework to achieve model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xianggebenben/pdADMM-G
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Stochastic Gradient Optimization Techniques · Advanced Neural Network Applications

MethodsStochastic Gradient Descent