TinyKG: Memory-Efficient Training Framework for Knowledge Graph Neural   Recommender Systems

Huiyuan Chen; Xiaoting Li; Kaixiong Zhou; Xia Hu; Chin-Chia Michael; Yeh; Yan Zheng; Hao Yang

arXiv:2212.04540·cs.IR·December 12, 2022

TinyKG: Memory-Efficient Training Framework for Knowledge Graph Neural Recommender Systems

Huiyuan Chen, Xiaoting Li, Kaixiong Zhou, Xia Hu, Chin-Chia Michael, Yeh, Yan Zheng, Hao Yang

PDF

TL;DR

TinyKG is a GPU-based training framework for Knowledge Graph Neural Networks that significantly reduces memory usage through quantization, enabling deployment in memory-limited environments with minimal accuracy loss.

Contribution

We introduce TinyKG, a novel memory-efficient training method for KGNNs using activation quantization, facilitating scalable training on resource-constrained devices.

Findings

01

Memory footprint reduced by 7x with INT2 quantization

02

Achieves only 2% accuracy loss

03

Enables deployment on memory-constrained devices

Abstract

There has been an explosion of interest in designing various Knowledge Graph Neural Networks (KGNNs), which achieve state-of-the-art performance and provide great explainability for recommendation. The promising performance is mainly resulting from their capability of capturing high-order proximity messages over the knowledge graphs. However, training KGNNs at scale is challenging due to the high memory usage. In the forward pass, the automatic differentiation engines (\textsl{e.g.}, TensorFlow/PyTorch) generally need to cache all intermediate activation maps in order to compute gradients in the backward pass, which leads to a large GPU memory footprint. Existing work solves this problem by utilizing multi-GPU distributed frameworks. Nonetheless, this poses a practical challenge when seeking to deploy KGNNs in memory-constrained environments, especially for industry-scale graphs. Here…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.