SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix Operations
Md Saidul Hoque Anik, Ariful Azad

TL;DR
SparseTransX introduces a sparse matrix operation-based framework that significantly accelerates training of translation-based knowledge graph embeddings, reducing time and memory usage across various models and datasets.
Contribution
The paper presents a novel sparse matrix kernel approach for KG embedding training, achieving substantial speedups and low memory footprint, applicable to multiple models.
Findings
Up to 5.3x speedup on CPU
Up to 4.2x speedup on GPU
Consistent performance across datasets
Abstract
Knowledge graph (KG) learning offers a powerful framework for generating new knowledge and making inferences. Training KG embedding can take a significantly long time, especially for larger datasets. Our analysis shows that the gradient computation of embedding is one of the dominant functions in the translation-based KG embedding training loop. We address this issue by replacing the core embedding computation with SpMM (Sparse-Dense Matrix Multiplication) kernels. This allows us to unify multiple scatter (and gather) operations as a single operation, reducing training time and memory usage. We create a general framework for training KG models using sparse kernels and implement four models, namely TransE, TransR, TransH, and TorusE. Our sparse implementations exhibit up to 5.3x speedup on the CPU and up to 4.2x speedup on the GPU with a significantly low GPU memory footprint. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Machine Learning in Healthcare
MethodsSelf-Adversarial Negative Sampling · RotatE · TransE
