TL;DR
This paper introduces kgTransformer, a novel knowledge graph transformer model with masked pre-training, capable of complex logical reasoning and providing explainability, outperforming existing methods on multiple benchmarks.
Contribution
The paper proposes a new transformer-based model with masked pre-training and MoE to handle complex logical queries over knowledge graphs, improving transferability and interpretability.
Findings
Outperforms KG embedding baselines on reasoning tasks
Handles complex logical queries with high accuracy
Provides explainable reasoning paths
Abstract
Knowledge graph (KG) embeddings have been a mainstream approach for reasoning over incomplete KGs. However, limited by their inherently shallow and static architectures, they can hardly deal with the rising focus on complex logical queries, which comprise logical operators, imputed edges, multiple source entities, and unknown intermediate entities. In this work, we present the Knowledge Graph Transformer (kgTransformer) with masked pre-training and fine-tuning strategies. We design a KG triple transformation method to enable Transformer to handle KGs, which is further strengthened by the Mixture-of-Experts (MoE) sparse activation. We then formulate the complex logical queries as masked prediction and introduce a two-stage masked pre-training strategy to improve transferability and generalizability. Extensive experiments on two benchmarks demonstrate that kgTransformer can consistently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Absolute Position Encodings · Label Smoothing · Position-Wise Feed-Forward Layer · Dropout · Residual Connection · Adam
