SKiM-GPT: combining biomedical literature-based discovery with large language model hypothesis evaluation

Jack Freeman; Robert J. Millikin; Leo Xu; Ishaan Sharma; Bethany Moore; Cannon Lock; Kevin Shine George; Aviral Bal; Chitrasen Mohanty; Ron Stewart

PMC · DOI:10.1186/s12859-025-06350-7·December 17, 2025

SKiM-GPT: combining biomedical literature-based discovery with large language model hypothesis evaluation

Jack Freeman, Robert J. Millikin, Leo Xu, Ishaan Sharma, Bethany Moore, Cannon Lock, Kevin Shine George, Aviral Bal, Chitrasen Mohanty, Ron Stewart

PDF

Open Access

TL;DR

SKiM-GPT combines literature-based discovery with large language models to efficiently evaluate biomedical hypotheses using retrieved evidence.

Contribution

Introduces SKiM-GPT, a transparent RAG system that evaluates hypotheses using SKiM co-occurrence and LLMs with human-verifiable justifications.

Findings

01

SKiM-GPT achieves strong agreement with expert biologists on a benchmark of disease-gene-drug hypotheses (Cohen’s κ = 0.84).

02

The system retrieves relevant abstracts, filters them, and provides hypothesis scores with natural language justifications.

Abstract

Generating and testing hypotheses is a critical aspect of biomedical science. Typically, researchers generate hypotheses by carefully analyzing available information and making logical connections, which are then tested. The accelerating growth of biomedical literature makes it increasingly difficult to keep pace with connections between biological entities emerging across biomedical research. Recently developed automated means of generating hypotheses can generate many more hypotheses than can be easily tested. One such approach involves literature‑based discovery (LBD) systems such as Serial KinderMiner (SKiM), which surfaces putative A‑B‑C links derived from term co‑occurrence. However, LBD systems leave three critical gaps: (i) they find statistical associations, not biological relationships; (ii) they can produce false‑positive leads; and (iii) they do not assess agreement with a…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Genes4

AHR EGFR CCK SLN

Proteins4

Species2

Liphistius sp. LM(species)Homo sapiens(human · species)

Cell lines1

-GPT— Homo sapiens (Human) · Chronic myelogenous leukemia, BCR-ABL1 positive · Cancer cell line

Chemicals4

GPT-4 gant61 Erlotinib fish oil

Diseases11

cancers hallucinations Raynaud's disease pancreatic cancer LLMs Breast Breast Cancer node diabetes Alzheimer's disease LBD

Figures5

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Meta-analysis and systematic reviews