Do Neural Scaling Laws Exist on Graph Self-Supervised Learning?

Qian Ma; Haitao Mao; Jingzhe Liu; Zhehua Zhang; Chunlin Feng; Yu Song,; Yihan Shao; Yao Ma

arXiv:2408.11243·cs.LG·August 28, 2024

Do Neural Scaling Laws Exist on Graph Self-Supervised Learning?

Qian Ma, Haitao Mao, Jingzhe Liu, Zhehua Zhang, Chunlin Feng, Yu Song,, Yihan Shao, Yao Ma

PDF

Open Access

TL;DR

This paper investigates whether current graph self-supervised learning methods follow neural scaling laws, finding that performance fluctuations are more influenced by architecture and task design than scale, challenging assumptions for building Graph Foundation Models.

Contribution

The study provides a comprehensive benchmark and analysis showing existing graph SSL techniques do not follow neural scaling laws, highlighting the importance of architecture and task design over scale.

Findings

01

Graph SSL performance does not consistently improve with scale.

02

Model architecture and pretext task choice are key performance factors.

03

Current SSL methods show performance fluctuations rather than scaling behavior.

Abstract

Self-supervised learning~(SSL) is essential to obtain foundation models in NLP and CV domains via effectively leveraging knowledge in large-scale unlabeled data. The reason for its success is that a suitable SSL design can help the model to follow the neural scaling law, i.e., the performance consistently improves with increasing model and dataset sizes. However, it remains a mystery whether existing SSL in the graph domain can follow the scaling behavior toward building Graph Foundation Models~(GFMs) with large-scale pre-training. In this study, we examine whether existing graph SSL techniques can follow the neural scaling behavior with the potential to serve as the essential component for GFMs. Our benchmark includes comprehensive SSL technique implementations with analysis conducted on both the conventional SSL setting and many new settings adopted in other domains. Surprisingly,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications