LLM-IE: a python package for biomedical generative information extraction with large language models

Enshuo Hsu; Kirk Roberts

doi:10.1093/jamiaopen/ooaf012

ScienceGate Book Chapters

JOURNAL ARTICLE

LLM-IE: a python package for biomedical generative information extraction with large language models

Enshuo Hsu Kirk Roberts

Year: 2025 Journal: JAMIA Open Vol: 8 (2)Pages: ooaf012-ooaf012 Publisher: University of Oxford

DOI: 10.1093/jamiaopen/ooaf012

Get Full-Text PDF Get Analytical Report

Abstract

Abstract Objectives Despite the recent adoption of large language models (LLMs) for biomedical information extraction (IE), challenges in prompt engineering and algorithms persist, with no dedicated software available. To address this, we developed LLM-IE: a Python package for building complete IE pipelines. Materials and Methods The LLM-IE supports named entity recognition, entity attribute extraction, and relation extraction tasks. We benchmarked it on the i2b2 clinical datasets. Results The sentence-based prompting algorithm resulted in the best 8-shot performance of over 70% strict F1 for entity extraction and about 60% F1 for entity attribute extraction. Discussion We developed a Python package, LLM-IE, highlighting (1) an interactive LLM agent to support schema definition and prompt design, (2) state-of-the-art prompting algorithms, and (3) visualization features. Conclusion The LLM-IE provides essential building blocks for developing robust information extraction pipelines. Future work will aim to expand its features and further optimize computational efficiency.

Keywords:

Python (programming language) Computer science Information extraction Relationship extraction Schema (genetic algorithms) Visualization Artificial intelligence Sentence Classifier (UML) Natural language processing Machine learning Data mining Information retrieval Programming language

Metrics

Cited By

43.38

FWCI (Field Weighted Citation Impact)

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Biomedical Text Mining and Ontologies

Life Sciences → Biochemistry, Genetics and Molecular Biology → Molecular Biology

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

LLM-IE: a python package for biomedical generative information extraction with large language models

Abstract

Metrics

Citation History

Topics

Related Documents

Large language models for generative information extraction: a survey

Towards Instruction-Tuned Verification for Improving Biomedical Information Extraction with Large Language Models

Generative AI: foundational models. Natural Language Processing (NLP) and LARGE Language Models (LLM)

LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion

LLM – Large Language Models