HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance
Rosni Vasu, Chandrayee Basu, Bhavana Dalvi Mishra, Cristina Sarasua, Peter Clark, Abraham Bernstein

TL;DR
HypER is a specialized small language model designed for literature-grounded hypothesis generation, emphasizing reasoning and evidence provenance, outperforming baseline models in validity discrimination and hypothesis quality.
Contribution
It introduces HypER, a multi-task trained model that improves scientific hypothesis generation by incorporating reasoning and evidence provenance, surpassing existing methods.
Findings
HypER outperforms base models in reasoning accuracy (+22% F1).
Generates more evidence-grounded hypotheses with higher quality scores.
Achieves high feasibility and impact ratings from human experts.
Abstract
Large Language models have demonstrated promising performance in research ideation across scientific domains. Hypothesis development, the process of generating a highly specific declarative statement connecting a research idea with empirical validation, has received relatively less attention. Existing approaches trivially deploy retrieval augmentation and focus only on the quality of the final output ignoring the underlying reasoning process behind ideation. We present (othesis Generation with xplanation and easoning), a small language model (SLM) trained for literature-guided reasoning and evidence-based hypothesis generation. is trained in a multi-task setting to discriminate between valid and invalid scientific reasoning chains in presence of controlled distractions. We find that outperformes the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
MethodsFocus · Balanced Selection
