HINormer: Representation Learning On Heterogeneous Information Networks with Graph Transformer
Qiheng Mao, Zemin Liu, Chenghao Liu, Jianling Sun

TL;DR
HINormer introduces a novel graph transformer model for heterogeneous information networks, effectively capturing structural and relation-based information to improve node representations beyond existing methods.
Contribution
This paper presents HINormer, the first to adapt graph transformers for HINs, utilizing large-range aggregation with local and relation encoders for enhanced representation learning.
Findings
Outperforms state-of-the-art on four HIN benchmarks
Effectively captures structural and heterogeneous information
Demonstrates superior node classification accuracy
Abstract
Recent studies have highlighted the limitations of message-passing based graph neural networks (GNNs), e.g., limited model expressiveness, over-smoothing, over-squashing, etc. To alleviate these issues, Graph Transformers (GTs) have been proposed which work in the paradigm that allows message passing to a larger coverage even across the whole graph. Hinging on the global range attention mechanism, GTs have shown a superpower for representation learning on homogeneous graphs. However, the investigation of GTs on heterogeneous information networks (HINs) is still under-exploited. In particular, on account of the existence of heterogeneity, HINs show distinct data characteristics and thus require different treatment. To bridge this gap, in this paper we investigate the representation learning on HINs with Graph Transformer, and propose a novel model named HINormer, which capitalizes on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Recommender Systems and Techniques · Topic Modeling
MethodsAttention Is All You Need · Linear Layer · Dropout · Byte Pair Encoding · Adam · Multi-Head Attention · Residual Connection · Layer Normalization · Softmax · Label Smoothing
