Link Prediction on Textual Edge Graphs
Chen Ling, Zhuofeng Li, Yuntong Hu, Zheng Zhang, Zhongyuan Liu, Shuang, Zheng, Jian Pei, Liang Zhao

TL;DR
This paper introduces Link2Doc, a novel framework for link prediction on Textual-edge Graphs that combines neighborhood summarization as documents with self-supervised learning to improve semantic and topological understanding.
Contribution
The paper proposes a new approach that effectively integrates graph topology and edge semantics for link prediction on TEGs, outperforming existing methods.
Findings
Link2Doc achieves superior accuracy on real-world datasets.
The framework effectively combines GNNs and language models.
Ablation studies confirm the importance of neighborhood summarization.
Abstract
Textual-edge Graphs (TEGs), characterized by rich text annotations on edges, are increasingly significant in network science due to their ability to capture rich contextual information among entities. Existing works have proposed various edge-aware graph neural networks (GNNs) or let language models directly make predictions. However, they often fall short of fully capturing the contextualized semantics on edges and graph topology, respectively. This inadequacy is particularly evident in link prediction tasks that require a comprehensive understanding of graph topology and semantics between nodes. In this paper, we present a novel framework - Link2Doc, designed especially for link prediction on textual-edge graphs. Specifically, we propose to summarize neighborhood information between node pairs as a human-written document to preserve both semantic and topology information. A…
Peer Reviews
Decision·Submitted to ICLR 2025
1. The question of LP on TEGs is practical. 2. The idea of using (s,t)-transition graph for link prediction is interesting and makes sense to me. 3. Dstillinig knowledge from teacher LLM to student GNN is a well-established approach with proven effectiveness, which suits the setting of LP on TEGs.
1. I found it difficult to capture the key aspects of the proposed method. The authors may consider reorganizing the manuscript to put more emphasis on the overall pipeline and refer readers to appendix for details. The current version contains a lot of details and seems overwhelming to me. 2. Confusing experimental settings and missing baselines. Please see Questions.
1. The analysis of challenges in LLM-based and GNN-based methods are interesting, supported by real examples in Fig 1. 2. The proposed transition graph provides a clear view of target node pairs, avoiding information over-smoothing when propagating on whole graph.
1. Bad representation. The challenges of LLM-based and GNN-based methods are twisted together. Not clearly conveyed in texts and supported by experiments. 2. LM-enhanced GNNs use BERT to extract text representations, while other models use LLaMA-3 and GPT-4o, making comparison unfair. Besides, even though LM-enhanced GNNs use BERT, these models still perform the second best generally, making the improvements of Link2Doc doubtable. 3. The GNN layers are set to 2. The review suspect that, will the
This paper presents an innovative approach to address the shortcomings of existing GNN-based and LLM-based methods in link prediction. The authors effectively highlight the limitations of current methodologies and introduce a novel framework that integrates both graph structure and textual information.
One notable limitation is that the paper’s Transition Graph Document Construction approach appears to be similar with the Graph2Text component from the TAGA framework, as introduced in TAGA: Text-Attributed Graph Self-Supervised Learning by Synergizing Graph and Text Mutual Transformations https://arxiv.org/abs/2405.16800. If that is your work, please cite this paper properly, and discuss about the difference between your method with perspective of the algorithm development and complexity?
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Graph Neural Networks · Advanced Text Analysis Techniques
