What Makes AI Research Replicable? Executable Knowledge Graphs as Scientific Knowledge Representations

Yujie Luo; Zhuoyun Yu; Xuehai Wang; Yuqi Zhu; Ningyu Zhang; Lanning Wei; Lun Du; Da Zheng; Huajun Chen

arXiv:2510.17795·cs.CL·April 21, 2026

What Makes AI Research Replicable? Executable Knowledge Graphs as Scientific Knowledge Representations

Yujie Luo, Zhuoyun Yu, Xuehai Wang, Yuqi Zhu, Ningyu Zhang, Lanning Wei, Lun Du, Da Zheng, Huajun Chen

PDF

1 Repo

TL;DR

This paper introduces Executable Knowledge Graphs (xKG), a structured, paper-centric knowledge base that enhances AI research replication by integrating code snippets and technical insights, significantly improving performance across multiple frameworks.

Contribution

The paper presents xKG, a novel, extensible knowledge representation that captures technical details and code snippets from scientific literature to improve AI research replication.

Findings

01

xKG improves replication performance by 10.9% on PaperBench.

02

Integration of xKG into LLM agents enhances their ability to reproduce AI research.

03

xKG effectively captures technical details and code snippets from scientific papers.

Abstract

Replicating AI research is a crucial yet challenging task for large language model (LLM) agents. Existing approaches often struggle to generate executable code, primarily due to insufficient background knowledge and the limitations of retrieval-augmented generation (RAG) methods, which fail to capture latent technical details hidden in referenced papers. Furthermore, previous approaches tend to overlook valuable implementation-level code signals and lack structured knowledge representations that support multi-granular retrieval and reuse. To overcome these challenges, we propose Executable Knowledge Graphs (xKG), a pluggable, paper-centric knowledge base that automatically integrates code snippets and technical insights extracted from scientific literature. When integrated into three agent frameworks with two different LLMs, xKG shows substantial performance gains (10.9% with o3-mini)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zjunlp/xKG
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.