FOS: A Large-Scale Temporal Graph Benchmark for Scientific Interdisciplinary Link Prediction
Kiyan Rezaee, Morteza Ziabakhsh, Niloofar Nikfarjam, Mohammad M. Ghassemi, Yazdan Rezaee Jouryabi, Sadegh Eskandari, Reza Lashgari

TL;DR
FOS is a large-scale, time-aware graph benchmark for predicting novel interdisciplinary research field linkages, utilizing semantic embeddings and temporal data to advance scientific frontier forecasting.
Contribution
Introduces FOS, a comprehensive temporal graph dataset with semantic and topological features for predicting new scientific field connections, and evaluates state-of-the-art models on this benchmark.
Findings
Embedding textual descriptions improves prediction accuracy.
Different models perform best under different evaluation regimes.
Predicted links often align with subsequent real-world interdisciplinary developments.
Abstract
Interdisciplinary scientific breakthroughs mostly emerge unexpectedly, and forecasting the formation of novel research fields remains a major challenge. We introduce FOS (Future Of Science), a comprehensive time-aware graph-based benchmark that reconstructs annual co-occurrence graphs of 65,027 research sub-fields (spanning 19 general domains) over the period 1827-2024. In these graphs, edges denote the co-occurrence of two fields in a single publication and are timestamped with the corresponding publication year. Nodes are enriched with semantic embeddings, and edges are characterized by temporal and topological descriptors. We formulate the prediction of new field-pair linkages as a temporal link-prediction task, emphasizing the "first-time" connections that signify pioneering interdisciplinary directions. Through extensive experiments, we evaluate a suite of state-of-the-art temporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Complex Network Analysis Techniques · Bioinformatics and Genomic Networks
