Pre-Trained Models for Heterogeneous Information Networks

Yang Fang; Xiang Zhao; Yifan Chen; Weidong Xiao; Maarten de Rijke

arXiv:2007.03184·cs.AI·May 19, 2021

Pre-Trained Models for Heterogeneous Information Networks

Yang Fang, Xiang Zhao, Yifan Chen, Weidong Xiao, Maarten de Rijke

PDF

Open Access

TL;DR

This paper introduces PF-HIN, a self-supervised pre-training framework for heterogeneous information networks that enhances transferability and efficiency across multiple downstream tasks using transformer encoders.

Contribution

The paper proposes a novel pre-training and fine-tuning framework for heterogeneous information networks that reduces training costs and improves performance on various tasks.

Findings

01

PF-HIN outperforms state-of-the-art methods on four benchmark tasks.

02

Pre-training with self-supervised tasks improves transferability.

03

Transformer-based architecture effectively captures network features.

Abstract

In network representation learning we learn how to represent heterogeneous information networks in a low-dimensional space so as to facilitate effective search, classification, and prediction solutions. Previous network representation learning methods typically require sufficient task-specific labeled data to address domain-specific problems. The trained model usually cannot be transferred to out-of-domain datasets. We propose a self-supervised pre-training and fine-tuning framework, PF-HIN, to capture the features of a heterogeneous information network. Unlike traditional network representation learning models that have to train the entire model all over again for every downstream task and dataset, PF-HIN only needs to fine-tune the model and a small number of extra task-specific parameters, thus improving model efficiency and effectiveness. During pre-training, we first transform the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Network Security and Intrusion Detection · Topic Modeling

MethodsLinear Layer · Tanh Activation · Sigmoid Activation · Adam · Multi-Head Attention · Layer Normalization · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · Attention Is All You Need · Attention Dropout