Frugal Knowledge Graph Construction with Local LLMs: A Zero-Shot Pipeline, Self-Consistency and Wisdom of Artificial Crowds
Pierre Jourlin (LIA)

TL;DR
This paper empirically evaluates a zero-shot knowledge graph construction pipeline using local LLMs on consumer hardware, achieving competitive results and analyzing diversity and consensus effects.
Contribution
It introduces a reproducible, zero-shot pipeline for knowledge graph tasks with detailed evaluation and insights into diversity mechanisms and model consensus effects.
Findings
Achieves 0.70 F1 on 500 document relations in zero-shot setting.
Self-consistency improves multi-hop reasoning EM by up to 23%.
The full pipeline reaches an EM of 0.55 with confidence-routing, running efficiently on a single GPU.
Abstract
This paper presents an empirical study of a multi-model zero-shot pipeline for knowledge graph construction and exploitation, executed entirely through local inference on consumer-grade hardware. We propose a reproducible evaluation framework integrating two external benchmarks (DocRED, HotpotQA), WebQuestionsSP-style synthetic data, and the RAGAS evaluation framework in an automated pipeline. On 500 document-level relations, our system achieves an F1 of 0.70 0.041 in zero-shot, compared to 0.80 for supervised DREEAM. Text-to-query achieves an accuracy of 0.80 0.06 on 200 samples. Multi-hop reasoning achieves an Exact Match (EM) of 0.460.04 on 500 HotpotQA questions, with a RAGAS faithfulness of 0.96 0.04 on 50 samples. Beyond the pipeline, we study diversity mechanisms for difficult multi-hop reasoning. On 181 questions unsolvable at zero temperature,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
