Node Level Graph Autoencoder: Unified Pretraining for Textual Graph Learning
Wenbin Hu, Huihao Jing, Qi Hu, Haoran Li, Yangqiu Song

TL;DR
This paper introduces NodeGAE, a unified unsupervised autoencoder framework using language models for textual graph embedding, effectively capturing structure and text to improve downstream tasks.
Contribution
The paper presents a novel pretraining autoencoder that combines language models with graph structure awareness, offering a simple yet effective approach for textual graph representation learning.
Findings
Significantly improves GNN performance on textual graph tasks
Demonstrates strong generalizability across datasets
Enhances node classification and link prediction results
Abstract
Textual graphs are ubiquitous in real-world applications, featuring rich text information with complex relationships, which enables advanced research across various fields. Textual graph representation learning aims to generate low-dimensional feature embeddings from textual graphs that can improve the performance of downstream tasks. A high-quality feature embedding should effectively capture both the structural and the textual information in a textual graph. However, most textual graph dataset benchmarks rely on word2vec techniques to generate feature embeddings, which inherently limits their capabilities. Recent works on textual graph representation learning can be categorized into two folds: supervised and unsupervised methods. Supervised methods finetune a language model on labeled nodes, which have limited capabilities when labeled data is scarce. Unsupervised methods, on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Semantic Web and Ontologies
MethodsAttentive Walk-Aggregating Graph Neural Network
