SKiM-GPT: combining biomedical literature-based discovery with large language model hypothesis evaluation
Jack Freeman, Robert J. Millikin, Leo Xu, Ishaan Sharma, Bethany Moore, Cannon Lock, Kevin Shine George, Aviral Bal, Chitrasen Mohanty, Ron Stewart

TL;DR
SKiM-GPT combines literature-based discovery with large language models to efficiently evaluate biomedical hypotheses using retrieved evidence.
Contribution
Introduces SKiM-GPT, a transparent RAG system that evaluates hypotheses using SKiM co-occurrence and LLMs with human-verifiable justifications.
Findings
SKiM-GPT achieves strong agreement with expert biologists on a benchmark of disease-gene-drug hypotheses (Cohen’s κ = 0.84).
The system retrieves relevant abstracts, filters them, and provides hypothesis scores with natural language justifications.
Abstract
Generating and testing hypotheses is a critical aspect of biomedical science. Typically, researchers generate hypotheses by carefully analyzing available information and making logical connections, which are then tested. The accelerating growth of biomedical literature makes it increasingly difficult to keep pace with connections between biological entities emerging across biomedical research. Recently developed automated means of generating hypotheses can generate many more hypotheses than can be easily tested. One such approach involves literature‑based discovery (LBD) systems such as Serial KinderMiner (SKiM), which surfaces putative A‑B‑C links derived from term co‑occurrence. However, LBD systems leave three critical gaps: (i) they find statistical associations, not biological relationships; (ii) they can produce false‑positive leads; and (iii) they do not assess agreement with a…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Meta-analysis and systematic reviews
