A phenotype-driven and evidence-governed framework for knowledge graph enrichment and hypotheses discovery in population data
Adela B\^ara, Simona-Vasilica Oprea

TL;DR
This paper introduces a novel framework combining GNNs, causal inference, probabilistic reasoning, and LLMs to enhance knowledge graph expansion and hypothesis discovery in population data, emphasizing interpretability and evidence-based validation.
Contribution
It presents a phenotype-driven, evidence-governed approach that balances discovery and confirmation, outperforming existing methods in generating relevant, novel, and validated knowledge claims.
Findings
Produces more interpretable phenotypes
Reveals context-dependent causal structures
Achieves high-quality, validated claims with better trade-offs
Abstract
Current knowledge graph (KG) construction methods are confirmatory, focusing on recovering known relationships rather than identifying novel or context-dependent nodes. This paper proposes a phenotype-driven and evidence-governed framework that shifts the paradigm toward structured hypothesis discovery and controlled KG expansion. The approach integrates graph neural networks (GNNs) for phenotype discovery, causal inference, probabilistic reasoning and large language models (LLMs) for hypothesis generation and claim extraction within a unified pipeline. The framework prioritizes relationships that are both structurally supported by data and underexplored in the literature. KG expansion is formulated as a multi-objective optimization problem, where candidate claims are jointly evaluated in terms of relevance, structural validation and novelty. Pareto-optimal selection enables the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
