Toward Reliable Ad-hoc Scientific Information Extraction: A Case Study on Two Materials Datasets

Satanu Ghosh; Neal R. Brodnik; Carolina Frey; Collin Holgate; Tresa M. Pollock; Samantha Daly; Samuel Carton

arXiv:2406.05348·cs.CL·November 18, 2025·1 cites

Toward Reliable Ad-hoc Scientific Information Extraction: A Case Study on Two Materials Datasets

Satanu Ghosh, Neal R. Brodnik, Carolina Frey, Collin Holgate, Tresa M. Pollock, Samantha Daly, Samuel Carton

PDF

Open Access 1 Repo

TL;DR

This paper investigates GPT-4's capability to perform ad-hoc, schema-based information extraction from scientific literature in materials science, analyzing its accuracy and limitations through expert error analysis.

Contribution

It introduces a case study evaluating GPT-4's effectiveness in replicating existing datasets and provides insights for improving automated scientific information extraction.

Findings

01

GPT-4 can partially replicate datasets with basic prompting

02

Manual error analysis reveals specific extraction challenges

03

Research directions proposed for enhancing extraction fidelity

Abstract

We explore the ability of GPT-4 to perform ad-hoc schema based information extraction from scientific literature. We assess specifically whether it can, with a basic prompting approach, replicate two existing material science datasets, given the manuscripts from which they were originally manually extracted. We employ materials scientists to perform a detailed manual error analysis to assess where the model struggles to faithfully extract the desired information, and draw on their insights to suggest research directions to address this broadly important task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

satanug/ad_hoc_information_extraction
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Quality and Management · Web Data Mining and Analysis

MethodsAttention Is All You Need · Softmax · Layer Normalization · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Multi-Head Attention · Position-Wise Feed-Forward Layer