CHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for Literature Review Support
Chao-Chun Hsu, Erin Bransom, Jenna Sparks, Bailey Kuehl, Chenhao Tan,, David Wadden, Lucy Lu Wang, Aakanksha Naik

TL;DR
This paper explores using large language models to create hierarchical structures of scientific literature for aiding reviews, introduces a new dataset called CHIME, and demonstrates improvements with human-in-the-loop corrections.
Contribution
It presents a novel LLM-based pipeline for hierarchical organization of scientific studies, introduces the CHIME dataset, and develops a human feedback-based corrector model to enhance study assignment accuracy.
Findings
LLMs can generate promising hierarchies but need improvements in study assignment.
Expert corrections help quantify and improve LLM performance.
The corrector model improves study assignment F1 score by 12.6 points.
Abstract
Literature review requires researchers to synthesize a large amount of information and is increasingly challenging as the scientific literature expands. In this work, we investigate the potential of LLMs for producing hierarchical organizations of scientific studies to assist researchers with literature review. We define hierarchical organizations as tree structures where nodes refer to topical categories and every node is linked to the studies assigned to that category. Our naive LLM-based pipeline for hierarchy generation from a set of studies produces promising yet imperfect hierarchies, motivating us to collect CHIME, an expert-curated dataset for this task focused on biomedicine. Given the challenging and time-consuming nature of building hierarchies from scratch, we use a human-in-the-loop process in which experts correct errors (both links between categories and study assignment)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
MethodsSparse Evolutionary Training
