OGB-LSC: A Large-Scale Challenge for Machine Learning on Graphs
Weihua Hu, Matthias Fey, Hongyu Ren, Maho Nakata, Yuxiao Dong, Jure, Leskovec

TL;DR
The paper introduces OGB-LSC, a set of large-scale graph datasets for advancing machine learning on graphs, accompanied by baseline models, and reports on its success in a major competition to foster innovation.
Contribution
It provides the first large-scale, real-world graph datasets for ML, along with baseline experiments and insights from a global competition to accelerate research.
Findings
Expressive models outperform simple baselines on large datasets.
Participation in KDD Cup led to significant performance improvements.
The datasets and benchmarks are now publicly available for research.
Abstract
Enabling effective and efficient machine learning (ML) over large-scale graph data (e.g., graphs with billions of edges) can have a great impact on both industrial and scientific applications. However, existing efforts to advance large-scale graph ML have been largely limited by the lack of a suitable public benchmark. Here we present OGB Large-Scale Challenge (OGB-LSC), a collection of three real-world datasets for facilitating the advancements in large-scale graph ML. The OGB-LSC datasets are orders of magnitude larger than existing ones, covering three core graph learning tasks -- link prediction, graph regression, and node classification. Furthermore, we provide dedicated baseline experiments, scaling up expressive graph ML models to the massive datasets. We show that expressive models significantly outperform simple scalable baselines, indicating an opportunity for dedicated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Complex Network Analysis Techniques · Machine Learning in Materials Science
