MulDE: Multi-teacher Knowledge Distillation for Low-dimensional   Knowledge Graph Embeddings

Kai Wang; Yu Liu; Qian Ma; Quan Z. Sheng

arXiv:2010.07152·cs.AI·April 2, 2021

MulDE: Multi-teacher Knowledge Distillation for Low-dimensional Knowledge Graph Embeddings

Kai Wang, Yu Liu, Qian Ma, Quan Z. Sheng

PDF

TL;DR

MulDE is a knowledge distillation framework that enhances low-dimensional knowledge graph embeddings by leveraging multiple hyperbolic teacher models, improving performance and training efficiency.

Contribution

Introduces a novel multi-teacher distillation method with iterative strategies and adaptive mechanisms for low-dimensional KGE models.

Findings

01

Distilled 32-dimensional models outperform some high-dimensional methods.

02

MulDE improves training speed and accuracy of low-dimensional KGE models.

03

Effective knowledge transfer from multiple hyperbolic teachers enhances low-dimensional embeddings.

Abstract

Link prediction based on knowledge graph embeddings (KGE) aims to predict new triples to automatically construct knowledge graphs (KGs). However, recent KGE models achieve performance improvements by excessively increasing the embedding dimensions, which may cause enormous training costs and require more storage space. In this paper, instead of training high-dimensional models, we propose MulDE, a novel knowledge distillation framework, which includes multiple low-dimensional hyperbolic KGE models as teachers and two student components, namely Junior and Senior. Under a novel iterative distillation strategy, the Junior component, a low-dimensional KGE model, asks teachers actively based on its preliminary prediction results, and the Senior component integrates teachers' knowledge adaptively to train the Junior component based on two mechanisms: relation-specific scaling and contrast…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsKnowledge Distillation