TinyKG: Memory-Efficient Training Framework for Knowledge Graph Neural Recommender Systems
Huiyuan Chen, Xiaoting Li, Kaixiong Zhou, Xia Hu, Chin-Chia Michael, Yeh, Yan Zheng, Hao Yang

TL;DR
TinyKG is a GPU-based training framework for Knowledge Graph Neural Networks that significantly reduces memory usage through quantization, enabling deployment in memory-limited environments with minimal accuracy loss.
Contribution
We introduce TinyKG, a novel memory-efficient training method for KGNNs using activation quantization, facilitating scalable training on resource-constrained devices.
Findings
Memory footprint reduced by 7x with INT2 quantization
Achieves only 2% accuracy loss
Enables deployment on memory-constrained devices
Abstract
There has been an explosion of interest in designing various Knowledge Graph Neural Networks (KGNNs), which achieve state-of-the-art performance and provide great explainability for recommendation. The promising performance is mainly resulting from their capability of capturing high-order proximity messages over the knowledge graphs. However, training KGNNs at scale is challenging due to the high memory usage. In the forward pass, the automatic differentiation engines (\textsl{e.g.}, TensorFlow/PyTorch) generally need to cache all intermediate activation maps in order to compute gradients in the backward pass, which leads to a large GPU memory footprint. Existing work solves this problem by utilizing multi-GPU distributed frameworks. Nonetheless, this poses a practical challenge when seeking to deploy KGNNs in memory-constrained environments, especially for industry-scale graphs. Here…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
