CHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for   Literature Review Support

Chao-Chun Hsu; Erin Bransom; Jenna Sparks; Bailey Kuehl; Chenhao Tan,; David Wadden; Lucy Lu Wang; Aakanksha Naik

arXiv:2407.16148·cs.CL·July 24, 2024

CHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for Literature Review Support

Chao-Chun Hsu, Erin Bransom, Jenna Sparks, Bailey Kuehl, Chenhao Tan,, David Wadden, Lucy Lu Wang, Aakanksha Naik

PDF

3 Datasets 1 Video

TL;DR

This paper explores using large language models to create hierarchical structures of scientific literature for aiding reviews, introduces a new dataset called CHIME, and demonstrates improvements with human-in-the-loop corrections.

Contribution

It presents a novel LLM-based pipeline for hierarchical organization of scientific studies, introduces the CHIME dataset, and develops a human feedback-based corrector model to enhance study assignment accuracy.

Findings

01

LLMs can generate promising hierarchies but need improvements in study assignment.

02

Expert corrections help quantify and improve LLM performance.

03

The corrector model improves study assignment F1 score by 12.6 points.

Abstract

Literature review requires researchers to synthesize a large amount of information and is increasingly challenging as the scientific literature expands. In this work, we investigate the potential of LLMs for producing hierarchical organizations of scientific studies to assist researchers with literature review. We define hierarchical organizations as tree structures where nodes refer to topical categories and every node is linked to the studies assigned to that category. Our naive LLM-based pipeline for hierarchy generation from a set of studies produces promising yet imperfect hierarchies, motivating us to collect CHIME, an expert-curated dataset for this task focused on biomedicine. Given the challenging and time-consuming nature of building hierarchies from scratch, we use a human-in-the-loop process in which experts correct errors (both links between categories and study assignment)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

CHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for Literature Review Support· underline

Taxonomy

MethodsSparse Evolutionary Training