Context-Aware Scientific Knowledge Extraction on Linked Open Data using Large Language Models
Sajratul Y. Rubaiat, Hasan M. Jamil

TL;DR
This paper presents WISE, a structured, LLM-powered system that efficiently extracts, refines, and synthesizes scientific knowledge from diverse sources, significantly improving recall and depth over traditional methods.
Contribution
Introduces WISE, a novel workflow leveraging large language models and a tree-based architecture for context-aware, non-redundant scientific knowledge extraction from multiple sources.
Findings
WISE reduces processed text by over 80% while increasing recall.
Outputs are more unique and in-depth compared to baselines.
System is adaptable to various scientific domains.
Abstract
The exponential growth of scientific literature challenges researchers extracting and synthesizing knowledge. Traditional search engines return many sources without direct, detailed answers, while general-purpose LLMs may offer concise responses that lack depth or omit current information. LLMs with search capabilities are also limited by context window, yielding short, incomplete answers. This paper introduces WISE (Workflow for Intelligent Scientific Knowledge Extraction), a system addressing these limits by using a structured workflow to extract, refine, and rank query-specific knowledge. WISE uses an LLM-powered, tree-based architecture to refine data, focusing on query-aligned, context-aware, and non-redundant information. Dynamic scoring and ranking prioritize unique contributions from each source, and adaptive stopping criteria minimize processing overhead. WISE delivers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
