Higher-Order Knowledge Representations for Agentic Scientific Reasoning
Isabella A. Stewart, Markus J. Buehler

TL;DR
This paper introduces a hypergraph-based knowledge representation framework for scientific reasoning, enabling agentic systems to generate grounded hypotheses by capturing complex multi-entity relationships in large scientific corpora.
Contribution
It presents a novel hypergraph construction methodology that encodes higher-order interactions, improving upon traditional knowledge graphs for scientific reasoning tasks.
Findings
Constructed a hypergraph with over 160,000 nodes from scientific literature.
Revealed a scale-free topology with highly connected conceptual hubs.
Enabled hypothesis generation linking distant scientific concepts.
Abstract
Scientific inquiry requires systems-level reasoning that integrates heterogeneous experimental data, cross-domain knowledge, and mechanistic evidence into coherent explanations. While Large Language Models (LLMs) offer inferential capabilities, they often depend on retrieval-augmented contexts that lack structural depth. Traditional Knowledge Graphs (KGs) attempt to bridge this gap, yet their pairwise constraints fail to capture the irreducible higher-order interactions that govern emergent physical behavior. To address this, we introduce a methodology for constructing hypergraph-based knowledge representations that faithfully encode multi-entity relationships. Applied to a corpus of ~1,100 manuscripts on biocomposite scaffolds, our framework constructs a global hypergraph of 161,172 nodes and 320,201 hyperedges, revealing a scale-free topology (power law exponent ~1.23) organized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning in Materials Science · Topic Modeling
