HittER: Hierarchical Transformers for Knowledge Graph Embeddings
Sanxing Chen, Xiaodong Liu, Jianfeng Gao, Jian Jiao, Ruofei Zhang and, Yangfeng Ji

TL;DR
HittER introduces a hierarchical Transformer model for knowledge graph embeddings that jointly captures entity-relation composition and relational context, achieving state-of-the-art results in link prediction and question answering tasks.
Contribution
The paper presents a novel hierarchical Transformer architecture for knowledge graph embedding that effectively models local neighborhood information and relational context.
Findings
HittER outperforms existing models on multiple link prediction datasets.
HittER integrated into BERT improves performance on Freebase question answering.
The hierarchical Transformer captures complex relational structures effectively.
Abstract
This paper examines the challenging problem of learning representations of entities and relations in a complex multi-relational knowledge graph. We propose HittER, a Hierarchical Transformer model to jointly learn Entity-relation composition and Relational contextualization based on a source entity's neighborhood. Our proposed model consists of two different Transformer blocks: the bottom block extracts features of each entity-relation pair in the local neighborhood of the source entity and the top block aggregates the relational information from outputs of the bottom block. We further design a masked entity prediction task to balance information from the relational context and the source entity itself. Experimental results show that HittER achieves new state-of-the-art results on multiple link prediction datasets. We additionally propose a simple approach to integrate HittER into BERT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Data Quality and Management
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Attention Is All You Need · Byte Pair Encoding · Adam · Dropout · Label Smoothing · Multi-Head Attention · Residual Connection
