Pre-Trained Models for Heterogeneous Information Networks
Yang Fang, Xiang Zhao, Yifan Chen, Weidong Xiao, Maarten de Rijke

TL;DR
This paper introduces PF-HIN, a self-supervised pre-training framework for heterogeneous information networks that enhances transferability and efficiency across multiple downstream tasks using transformer encoders.
Contribution
The paper proposes a novel pre-training and fine-tuning framework for heterogeneous information networks that reduces training costs and improves performance on various tasks.
Findings
PF-HIN outperforms state-of-the-art methods on four benchmark tasks.
Pre-training with self-supervised tasks improves transferability.
Transformer-based architecture effectively captures network features.
Abstract
In network representation learning we learn how to represent heterogeneous information networks in a low-dimensional space so as to facilitate effective search, classification, and prediction solutions. Previous network representation learning methods typically require sufficient task-specific labeled data to address domain-specific problems. The trained model usually cannot be transferred to out-of-domain datasets. We propose a self-supervised pre-training and fine-tuning framework, PF-HIN, to capture the features of a heterogeneous information network. Unlike traditional network representation learning models that have to train the entire model all over again for every downstream task and dataset, PF-HIN only needs to fine-tune the model and a small number of extra task-specific parameters, thus improving model efficiency and effectiveness. During pre-training, we first transform the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Network Security and Intrusion Detection · Topic Modeling
MethodsLinear Layer · Tanh Activation · Sigmoid Activation · Adam · Multi-Head Attention · Layer Normalization · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · Attention Is All You Need · Attention Dropout
