Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex   Logical Queries

Xiao Liu; Shiyu Zhao; Kai Su; Yukuo Cen; Jiezhong Qiu; Mengdi Zhang,; Wei Wu; Yuxiao Dong; Jie Tang

arXiv:2208.07638·cs.LG·August 17, 2022

Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries

Xiao Liu, Shiyu Zhao, Kai Su, Yukuo Cen, Jiezhong Qiu, Mengdi Zhang,, Wei Wu, Yuxiao Dong, Jie Tang

PDF

1 Repo

TL;DR

This paper introduces kgTransformer, a novel knowledge graph transformer model with masked pre-training, capable of complex logical reasoning and providing explainability, outperforming existing methods on multiple benchmarks.

Contribution

The paper proposes a new transformer-based model with masked pre-training and MoE to handle complex logical queries over knowledge graphs, improving transferability and interpretability.

Findings

01

Outperforms KG embedding baselines on reasoning tasks

02

Handles complex logical queries with high accuracy

03

Provides explainable reasoning paths

Abstract

Knowledge graph (KG) embeddings have been a mainstream approach for reasoning over incomplete KGs. However, limited by their inherently shallow and static architectures, they can hardly deal with the rising focus on complex logical queries, which comprise logical operators, imputed edges, multiple source entities, and unknown intermediate entities. In this work, we present the Knowledge Graph Transformer (kgTransformer) with masked pre-training and fine-tuning strategies. We design a KG triple transformation method to enable Transformer to handle KGs, which is further strengthened by the Mixture-of-Experts (MoE) sparse activation. We then formulate the complex logical queries as masked prediction and introduce a two-stage masked pre-training strategy to improve transferability and generalizability. Extensive experiments on two benchmarks demonstrate that kgTransformer can consistently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thudm/kgtransformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Absolute Position Encodings · Label Smoothing · Position-Wise Feed-Forward Layer · Dropout · Residual Connection · Adam