Assessing the Effects of Hyperparameters on Knowledge Graph Embedding Quality
Oliver Lloyd, Yi Liu, Tom Gaunt

TL;DR
This paper investigates how different hyperparameters affect the quality of knowledge graph embeddings using Sobol sensitivity analysis, revealing dataset-dependent sensitivities and proposing a leakage-robust UMLS variant.
Contribution
It applies Sobol sensitivity analysis to identify hyperparameters that significantly impact embedding quality and introduces UMLS-43, a leakage-robust knowledge graph variant.
Findings
Hyperparameter sensitivities vary across datasets.
Some hyperparameters can be eliminated without affecting quality.
UMLS-43 reduces data leakage risks.
Abstract
Embedding knowledge graphs into low-dimensional spaces is a popular method for applying approaches, such as link prediction or node classification, to these databases. This embedding process is very costly in terms of both computational time and space. Part of the reason for this is the optimisation of hyperparameters, which involves repeatedly sampling, by random, guided, or brute-force selection, from a large hyperparameter space and testing the resulting embeddings for their quality. However, not all hyperparameters in this search space will be equally important. In fact, with prior knowledge of the relative importance of the hyperparameters, some could be eliminated from the search altogether without significantly impacting the overall quality of the outputted embeddings. To this end, we ran a Sobol sensitivity analysis to evaluate the effects of tuning different hyperparameters on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Data Quality and Management · Data Mining Algorithms and Applications
