GRID: Graph Representation of Intelligence Data for Security Text Knowledge Graph Construction

Liangyi Huang; Zichen Liu; Fei Shao; Shang Ma; Mengshi Zhang; Zihao Chen; Yanfang Ye; and Xusheng Xiao

arXiv:2605.16714·cs.AI·May 19, 2026

GRID: Graph Representation of Intelligence Data for Security Text Knowledge Graph Construction

Liangyi Huang, Zichen Liu, Fei Shao, Shang Ma, Mengshi Zhang, Zihao Chen, Yanfang Ye, and Xusheng Xiao

PDF

TL;DR

The paper introduces GRID, an end-to-end framework for constructing security knowledge graphs from cyber threat intelligence texts, leveraging a novel supervision pipeline and reward models to improve extraction accuracy and stability.

Contribution

GRID presents a new supervision pipeline and reward modeling approach for security text knowledge graph construction, achieving high precision and recall with lower costs.

Findings

01

The primary reward model achieves 84.62% precision and 64.91% recall on CTI articles.

02

The framework outperforms previous methods in source-averaged recall and F1 score.

03

Task-bank rewards can be reused across runs, reducing costs and improving stability.

Abstract

Security knowledge graphs can provide computable external memory for security agents, but constructing them from long-form cyber threat intelligence (CTI) remains difficult: LLMs often lack grounded security-domain knowledge, and end-to-end document-to-graph training is hard to supervise with cheap, stable rewards. We present GRID (Graph Representation of Intelligence Data), an end-to-end framework for security text knowledge graph construction. GRID first builds security-domain supervision from CTI articles by creating traceable article-graph alignments through graph extraction and knowledge-graph-conditioned text revision. It then turns document-to-graph learning into a scripted task bank combining four-option multi-select questions with triple-level regex matching targets, yielding more stable task-specific rewards than repeatedly scoring full graph outputs with an LLM judge. Using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.