Procedural Text Mining with Large Language Models
Anisa Rula, Jennifer D'Souza

TL;DR
This paper explores using GPT-4 and in-context learning techniques to extract procedural information from unstructured PDF texts, demonstrating promising results and addressing data scarcity challenges in NLP procedure extraction.
Contribution
It introduces a novel application of large language models with in-context learning for procedural text mining from PDFs, highlighting customizations that improve extraction without extensive training data.
Findings
In-context learning enhances procedure extraction accuracy.
Ontology-based definitions improve model understanding.
Limited samples suffice for effective zero-shot learning.
Abstract
Recent advancements in the field of Natural Language Processing, particularly the development of large-scale language models that are pretrained on vast amounts of knowledge, are creating novel opportunities within the realm of Knowledge Engineering. In this paper, we investigate the usage of large language models (LLMs) in both zero-shot and in-context learning settings to tackle the problem of extracting procedures from unstructured PDF text in an incremental question-answering fashion. In particular, we leverage the current state-of-the-art GPT-4 (Generative Pre-trained Transformer 4) model, accompanied by two variations of in-context learning that involve an ontology with definitions of procedures and steps and a limited number of samples of few-shot learning. The findings highlight both the promise of this approach and the value of the in-context learning customisations. These…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research
MethodsMulti-Head Attention · Attention Is All You Need · Dropout · Dense Connections · Linear Layer · Label Smoothing · Adam · Absolute Position Encodings · Residual Connection · Layer Normalization
