Large language models for generative information extraction: a survey

Derong Xu; Wei Chen; Wenjun Peng; Chao Zhang; Tong Xu; Xiangyu Zhao; Xian Wu; Yefeng Zheng; Yan Wang; Enhong Chen

doi:10.1007/s11704-024-40555-y

ScienceGate Book Chapters

JOURNAL ARTICLE

Large language models for generative information extraction: a survey

Derong Xu Wei Chen Wenjun Peng Chao Zhang Tong Xu Xiangyu Zhao Xian Wu Yefeng Zheng Yan Wang Enhong Chen

Year: 2024 Journal: Frontiers of Computer Science Vol: 18 (6) Publisher: Higher Education Press

DOI: 10.1007/s11704-024-40555-y

Get Full-Text PDF Get Analytical Report

Abstract

Abstract Information Extraction (IE) aims to extract structural knowledge from plain natural language texts. Recently, generative Large Language Models (LLMs) have demonstrated remarkable capabilities in text understanding and generation. As a result, numerous works have been proposed to integrate LLMs for IE tasks based on a generative paradigm. To conduct a comprehensive systematic review and exploration of LLM efforts for IE tasks, in this study, we survey the most recent advancements in this field. We first present an extensive overview by categorizing these works in terms of various IE subtasks and techniques, and then we empirically analyze the most advanced methods and discover the emerging trend of IE tasks with LLMs. Based on a thorough review conducted, we identify several insights in technique and promising research directions that deserve further exploration in future studies. We maintain a public repository and consistently update related works and resources on GitHub (LLM4IE repository).

Keywords:

Computer science Generative grammar Information extraction Natural language processing Extraction (chemistry) Artificial intelligence Generative model Information retrieval

Metrics

144

Cited By

91.98

FWCI (Field Weighted Citation Impact)

152

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Large language models for generative information extraction: a survey

Abstract

Metrics

Citation History

Topics

Related Documents

Generative Large Language Models

LLM-IE: a python package for biomedical generative information extraction with large language models

Medication information extraction using local large language models

Extraction of Subjective Information from Large Language Models

ADELIE: Aligning Large Language Models on Information Extraction