GRID: Graph Representation of Intelligence Data for Security Text Knowledge Graph Construction
Liangyi Huang, Zichen Liu, Fei Shao, Shang Ma, Mengshi Zhang, Zihao Chen, Yanfang Ye, and Xusheng Xiao

TL;DR
The paper introduces GRID, an end-to-end framework for constructing security knowledge graphs from cyber threat intelligence texts, leveraging a novel supervision pipeline and reward models to improve extraction accuracy and stability.
Contribution
GRID presents a new supervision pipeline and reward modeling approach for security text knowledge graph construction, achieving high precision and recall with lower costs.
Findings
The primary reward model achieves 84.62% precision and 64.91% recall on CTI articles.
The framework outperforms previous methods in source-averaged recall and F1 score.
Task-bank rewards can be reused across runs, reducing costs and improving stability.
Abstract
Security knowledge graphs can provide computable external memory for security agents, but constructing them from long-form cyber threat intelligence (CTI) remains difficult: LLMs often lack grounded security-domain knowledge, and end-to-end document-to-graph training is hard to supervise with cheap, stable rewards. We present GRID (Graph Representation of Intelligence Data), an end-to-end framework for security text knowledge graph construction. GRID first builds security-domain supervision from CTI articles by creating traceable article-graph alignments through graph extraction and knowledge-graph-conditioned text revision. It then turns document-to-graph learning into a scripted task bank combining four-option multi-select questions with triple-level regex matching targets, yielding more stable task-specific rewards than repeatedly scoring full graph outputs with an LLM judge. Using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
