IGLU: Efficient GCN Training via Lazy Updates

S Deepak Narayanan; Aditya Sinha; Prateek Jain; Purushottam Kar,; Sundararajan Sellamanickam

arXiv:2109.13995·cs.LG·April 5, 2022

IGLU: Efficient GCN Training via Lazy Updates

S Deepak Narayanan, Aditya Sinha, Prateek Jain, Purushottam Kar,, Sundararajan Sellamanickam

PDF

Open Access 1 Repo 1 Video

TL;DR

IGLU is a novel method for training multi-layer GCNs that caches intermediate computations to enable lazy updates, significantly reducing compute costs while maintaining or improving accuracy.

Contribution

IGLU introduces a caching-based lazy update mechanism for GCN training that reduces computation and maintains convergence despite bounded bias.

Findings

01

Up to 88% reduction in compute cost

02

Achieves up to 1.2% better accuracy

03

Converges to a first-order saddle point

Abstract

Training multi-layer Graph Convolution Networks (GCN) using standard SGD techniques scales poorly as each descent step ends up updating node embeddings for a large portion of the graph. Recent attempts to remedy this sub-sample the graph that reduces compute but introduce additional variance and may offer suboptimal performance. This paper develops the IGLU method that caches intermediate computations at various GCN layers thus enabling lazy updates that significantly reduce the compute cost of descent. IGLU introduces bounded bias into the gradients but nevertheless converges to a first-order saddle point under standard assumptions such as objective smoothness. Benchmark experiments show that IGLU offers up to 1.2% better accuracy despite requiring up to 88% less compute.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sdeepaknarayanan/iglu
tfOfficial

Videos

IGLU: Efficient GCN Training via Lazy Updates· slideslive

Taxonomy

TopicsAdvanced Graph Neural Networks · Stochastic Gradient Optimization Techniques · Caching and Content Delivery

MethodsStochastic Gradient Descent · Graph Convolutional Network · Convolution