Masked Language Models are Good Heterogeneous Graph Generalizers
Jinyu Yang, Cheng Yang, Shanyuan Cui, Zeyuan Guo, Liangwei Yang, Muhan Zhang, Zhiqiang Zhang, Chuan Shi

TL;DR
This paper introduces MLM4HG, a novel masked language modeling approach that uses metapath-based textual sequences and task-unified templates to enhance the cross-domain and multi-task generalization of heterogeneous graph learning models.
Contribution
MLM4HG replaces HG token encoding with metapath-based text sequences and unified templates, improving generalization across tasks and domains in heterogeneous graph learning.
Findings
Outperforms state-of-the-art methods in cross-domain tasks
Effective in few-shot and zero-shot learning scenarios
Demonstrates superior generalization on four real-world datasets
Abstract
Heterogeneous graph neural networks (HGNNs) excel at capturing structural and semantic information in heterogeneous graphs (HGs), while struggling to generalize across domains and tasks. With the rapid advancement of large language models (LLMs), a recent study explored the integration of HGNNs with LLMs for generalizable heterogeneous graph learning. However, this approach typically encodes structural information as HG tokens using HGNNs, and disparities in embedding spaces between HGNNs and LLMs have been shown to bias the LLM's comprehension of HGs. Moreover, since these HG tokens are often derived from node-level tasks, the model's ability to generalize across tasks remains limited. To this end, we propose a simple yet effective Masked Language Modeling-based method, called MLM4HG. MLM4HG introduces metapath-based textual sequences instead of HG tokens to extract structural and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Machine Learning in Healthcare
MethodsHunger Games Search
