GCN-ABFT: Low-Cost Online Error Checking for Graph Convolutional Networks
Christodoulos Peltekis, Giorgos Dimitrakopoulos

TL;DR
This paper introduces GCN-ABFT, a low-cost error detection method for GCN accelerators that computes checksums for entire matrix products in a single step, reducing overhead while maintaining accuracy.
Contribution
GCN-ABFT is the first approach to directly compute a checksum for the full three-matrix GCN layer, lowering error detection costs in hardware accelerators.
Findings
Reduces checksum computation operations by over 21% on average.
Maintains fault-detection accuracy through fault-injection analysis.
Applicable to representative GCN applications.
Abstract
Graph convolutional networks (GCNs) are popular for building machine-learning application for graph-structured data. This widespread adoption led to the development of specialized GCN hardware accelerators. In this work, we address a key architectural challenge for GCN accelerators: how to detect errors in GCN computations arising from random hardware faults with the least computation cost. Each GCN layer performs a graph convolution, mathematically equivalent to multiplying three matrices, computed through two separate matrix multiplications. Existing Algorithm-based Fault Tolerance(ABFT) techniques can check the results of individual matrix multiplications. However, for a GCN layer, this check should be performed twice. To avoid this overhead, this work introduces GCN-ABFT that directly calculates a checksum for the entire three-matrix product within a single GCN layer, providing a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Cloud Computing and Resource Management · Software-Defined Networks and 5G
MethodsGraph Convolutional Network
