LINK-KG: LLM-Driven Coreference-Resolved Knowledge Graphs for Human Smuggling Networks
Dipak Meher, Carlotta Domeniconi, Guadalupe Correa-Cabrera

TL;DR
LINK-KG is a modular framework that leverages large language models for coreference resolution to construct cleaner, more coherent knowledge graphs from complex legal texts about human smuggling networks.
Contribution
It introduces a novel LLM-guided coreference resolution pipeline with a type-specific Prompt Cache for scalable, accurate KG extraction from long, unstructured legal documents.
Findings
Reduced node duplication by 45.21%.
Decreased noisy nodes by 32.22%.
Produced cleaner, more coherent knowledge graphs.
Abstract
Human smuggling networks are complex and constantly evolving, making them difficult to analyze comprehensively. Legal case documents offer rich factual and procedural insights into these networks but are often long, unstructured, and filled with ambiguous or shifting references, posing significant challenges for automated knowledge graph (KG) construction. Existing methods either overlook coreference resolution or fail to scale beyond short text spans, leading to fragmented graphs and inconsistent entity linking. We propose LINK-KG, a modular framework that integrates a three-stage, LLM-guided coreference resolution pipeline with downstream KG extraction. At the core of our approach is a type-specific Prompt Cache, which consistently tracks and resolves references across document chunks, enabling clean and disambiguated narratives for structured knowledge graph construction from both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
