Context-Aware Scientific Knowledge Extraction on Linked Open Data using Large Language Models

Sajratul Y. Rubaiat; Hasan M. Jamil

arXiv:2506.17580·cs.IR·June 24, 2025

Context-Aware Scientific Knowledge Extraction on Linked Open Data using Large Language Models

Sajratul Y. Rubaiat, Hasan M. Jamil

PDF

TL;DR

This paper presents WISE, a structured, LLM-powered system that efficiently extracts, refines, and synthesizes scientific knowledge from diverse sources, significantly improving recall and depth over traditional methods.

Contribution

Introduces WISE, a novel workflow leveraging large language models and a tree-based architecture for context-aware, non-redundant scientific knowledge extraction from multiple sources.

Findings

01

WISE reduces processed text by over 80% while increasing recall.

02

Outputs are more unique and in-depth compared to baselines.

03

System is adaptable to various scientific domains.

Abstract

The exponential growth of scientific literature challenges researchers extracting and synthesizing knowledge. Traditional search engines return many sources without direct, detailed answers, while general-purpose LLMs may offer concise responses that lack depth or omit current information. LLMs with search capabilities are also limited by context window, yielding short, incomplete answers. This paper introduces WISE (Workflow for Intelligent Scientific Knowledge Extraction), a system addressing these limits by using a structured workflow to extract, refine, and rank query-specific knowledge. WISE uses an LLM-powered, tree-based architecture to refine data, focusing on query-aligned, context-aware, and non-redundant information. Dynamic scoring and ranking prioritize unique contributions from each source, and adaptive stopping criteria minimize processing overhead. WISE delivers…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.