LLM-IE: A Python Package for Generative Information Extraction with   Large Language Models

Enshuo Hsu; Kirk Roberts

arXiv:2411.11779·cs.LG·April 2, 2025

LLM-IE: A Python Package for Generative Information Extraction with Large Language Models

Enshuo Hsu, Kirk Roberts

PDF

Open Access

TL;DR

LLM-IE is a Python package that simplifies biomedical information extraction using large language models, featuring an interactive agent for schema and prompt design, and demonstrating strong performance on benchmark datasets.

Contribution

The paper introduces LLM-IE, a novel Python toolkit that streamlines the development of biomedical information extraction pipelines with an interactive LLM agent for schema and prompt management.

Findings

01

Sentence-based prompting achieves best performance

02

System evaluation includes intuitive visualization

03

Package is adopted in internal healthcare NLP projects

Abstract

Objectives: Despite the recent adoption of large language models (LLMs) for biomedical information extraction, challenges in prompt engineering and algorithms persist, with no dedicated software available. To address this, we developed LLM-IE: a Python package for building complete information extraction pipelines. Our key innovation is an interactive LLM agent to support schema definition and prompt design. Materials and Methods: The LLM-IE supports named entity recognition, entity attribute extraction, and relation extraction tasks. We benchmarked on the i2b2 datasets and conducted a system evaluation. Results: The sentence-based prompting algorithm resulted in the best performance while requiring a longer inference time. System evaluation provided intuitive visualization. Discussion: LLM-IE was designed from practical NLP experience in healthcare and has been adopted in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling