TAGLAS: An atlas of text-attributed graph datasets in the era of large   graph and language models

Jiarui Feng; Hao Liu; Lecheng Kong; Mingfang Zhu; Yixin Chen; Muhan; Zhang

arXiv:2406.14683·cs.LG·October 22, 2024

TAGLAS: An atlas of text-attributed graph datasets in the era of large graph and language models

Jiarui Feng, Hao Liu, Lecheng Kong, Mingfang Zhu, Yixin Chen, Muhan, Zhang

PDF

Open Access 1 Repo

TL;DR

TAGLAS introduces a comprehensive, unified collection of text-attributed graph datasets and benchmarks, enabling standardized training and evaluation of graph-language models across diverse domains.

Contribution

It provides a unified format, easy loading, and evaluation tools for over 23 datasets, facilitating research in text-attributed graph modeling.

Findings

01

Unified dataset format enables cross-domain evaluation.

02

Tools for text-to-embedding and graph-to-text conversion.

03

Open-source platform encourages community contributions.

Abstract

In this report, we present TAGLAS, an atlas of text-attributed graph (TAG) datasets and benchmarks. TAGs are graphs with node and edge features represented in text, which have recently gained wide applicability in training graph-language or graph foundation models. In TAGLAS, we collect and integrate more than 23 TAG datasets with domains ranging from citation graphs to molecule graphs and tasks from node classification to graph question-answering. Unlike previous graph datasets and benchmarks, all datasets in TAGLAS have a unified node and edge text feature format, which allows a graph model to be simultaneously trained and evaluated on multiple datasets from various domains. Further, we provide a standardized, efficient, and simplified way to load all datasets and tasks. We also provide useful utils like text-to-embedding conversion, and graph-to-text conversion, which can facilitate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jiaruifeng/taglas
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks