NOTE: Solution for KDD-CUP 2021 WikiKG90M-LSC
Weiyue Su, Zeyang Fang, Hui Zhong, Huijuan Wang, Siming Dai, Zhengjie, Huang, Yunsheng Shi, Shikun Feng, Zeyu Chen

TL;DR
This paper presents an ensemble approach combining advanced knowledge graph embedding methods, including a modified OTE, DeepWalk, and statistical features, to improve triplet prediction in the large WikiKG90M knowledge graph for KDD Cup 2021.
Contribution
It introduces NOTE, a modified version of OTE, and demonstrates how combining multiple embedding methods and feature engineering enhances triplet prediction accuracy.
Findings
Ensemble of methods outperforms individual models.
Modified OTE (NOTE) improves embedding quality.
Feature engineering boosts prediction performance.
Abstract
WikiKG90M in KDD Cup 2021 is a large encyclopedic knowledge graph, which could benefit various downstream applications such as question answering and recommender systems. Participants are invited to complete the knowledge graph by predicting missing triplets. Recent representation learning methods have achieved great success on standard datasets like FB15k-237. Thus, we train the advanced algorithms in different domains to learn the triplets, including OTE, QuatE, RotatE and TransE. Significantly, we modified OTE into NOTE (short for Norm-OTE) for better performance. Besides, we use both the DeepWalk and the post-smoothing technique to capture the graph structure for supplementation. In addition to the representations, we also use various statistical probabilities among the head entities, the relations and the tail entities for the final prediction. Experimental results show that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Natural Language Processing Techniques
MethodsSelf-Adversarial Negative Sampling · RotatE · DeepWalk · TransE
