# Asymmetric Learning for Graph Neural Network based Link Prediction

**Authors:** Kai-Lang Yao, Wu-Jun Li

arXiv: 2303.00170 · 2023-03-02

## TL;DR

This paper introduces AML, a novel asymmetric learning approach for graph neural network-based link prediction, significantly improving training speed on large-scale graphs without sacrificing accuracy.

## Contribution

It proposes the first asymmetric learning strategy for GNN-LP, combining GNN and MLP models for node representations, and introduces a row-wise sampling method for efficient training.

## Key findings

- AML is 1.7X to 7.3X faster than symmetric GNN-LP methods.
- AML maintains comparable accuracy to existing methods.
- The approach effectively scales GNN-based link prediction to large graphs.

## Abstract

Link prediction is a fundamental problem in many graph based applications, such as protein-protein interaction prediction. Graph neural network (GNN) has recently been widely used for link prediction. However, existing GNN based link prediction (GNN-LP) methods suffer from scalability problem during training for large-scale graphs, which has received little attention by researchers. In this paper, we first give computation complexity analysis of existing GNN-LP methods, which reveals that the scalability problem stems from their symmetric learning strategy adopting the same class of GNN models to learn representation for both head and tail nodes. Then we propose a novel method, called asymmetric learning (AML), for GNN-LP. The main idea of AML is to adopt a GNN model for learning head node representation while using a multi-layer perceptron (MLP) model for learning tail node representation. Furthermore, AML proposes a row-wise sampling strategy to generate mini-batch for training, which is a necessary component to make the asymmetric learning strategy work for training speedup. To the best of our knowledge, AML is the first GNN-LP method adopting an asymmetric learning strategy for node representation learning. Experiments on three real large-scale datasets show that AML is 1.7X~7.3X faster in training than baselines with a symmetric learning strategy, while having almost no accuracy loss.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2303.00170/full.md

## Figures

19 figures with captions in the complete paper: https://tomesphere.com/paper/2303.00170/full.md

## References

51 references — full list in the complete paper: https://tomesphere.com/paper/2303.00170/full.md

---
Source: https://tomesphere.com/paper/2303.00170