A Hierarchical N-Gram Framework for Zero-Shot Link Prediction
Mingchen Li, Junfan Chen, Samuel Mensah, Nikolaos Aletras and, Xiulong Yang, Yang Ye

TL;DR
This paper introduces a hierarchical n-gram framework using a GramTransformer to improve zero-shot link prediction in knowledge graphs, effectively handling out-of-vocabulary issues and achieving state-of-the-art results.
Contribution
It proposes a novel hierarchical n-gram graph model and GramTransformer for robust relation embedding in zero-shot link prediction.
Findings
Achieved state-of-the-art performance on two ZSLP datasets.
Effectively handles out-of-vocabulary relation names.
Demonstrates robustness over existing methods.
Abstract
Due to the incompleteness of knowledge graphs (KGs), zero-shot link prediction (ZSLP) which aims to predict unobserved relations in KGs has attracted recent interest from researchers. A common solution is to use textual features of relations (e.g., surface name or textual descriptions) as auxiliary information to bridge the gap between seen and unseen relations. Current approaches learn an embedding for each word token in the text. These methods lack robustness as they suffer from the out-of-vocabulary (OOV) problem. Meanwhile, models built on character n-grams have the capability of generating expressive representations for OOV words. Thus, in this paper, we propose a Hierarchical N-Gram framework for Zero-Shot Link Prediction (HNZSLP), which considers the dependencies among character n-grams of the relation surface name for ZSLP. Our approach works by first constructing a hierarchical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Dropout · Label Smoothing · Adam · Multi-Head Attention · Residual Connection · Absolute Position Encodings · Byte Pair Encoding
