Continuous Knowledge Metabolism: Generating Scientific Hypotheses from Evolving Literature
Jinkai Tao, Yubo Wang, Xiaoyu Liu, Menglin Yang

TL;DR
This paper introduces Continuous Knowledge Metabolism (CKM), a framework for evolving literature analysis that improves scientific hypothesis generation by incrementally updating knowledge bases and analyzing knowledge change signals.
Contribution
The paper presents CKM and its variants, CKM-Lite and CKM-Full, demonstrating improved hypothesis prediction, efficiency, and insights into knowledge evolution in scientific literature.
Findings
Incremental processing outperforms batch processing in hypothesis prediction and efficiency.
Change-aware instrumentation increases perceived novelty but reduces coverage.
Knowledge convergence signals significantly enhance hypothesis hit rates.
Abstract
Scientific hypothesis generation requires tracking how knowledge evolves, not just what is currently known. We introduce Continuous Knowledge Metabolism (CKM), a framework that processes scientific literature through sliding time windows and incrementally updates a structured knowledge base as new findings arrive. We present CKM-Lite, an efficient variant that achieves strong predictive coverage through incremental accumulation, outperforming batch processing on hit rate (+2.8%, p=0.006), hypothesis yield (+3.6, p<0.001), and best-match alignment (+0.43, p<0.001) while reducing token cost by 92%. To understand what drives these differences, we develop CKM-Full, an instrumented variant that categorizes each new finding as novel, confirming, or contradicting, detects knowledge change signals, and conditions hypothesis generation on the full evolution trajectory. Analyzing 892 hypotheses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
