JOURNAL ARTICLE

Enhancing Multi-Document Summarization with Cross-Document Graph-based Information Extraction

Abstract

Information extraction (IE) and summarization are closely related, both tasked with presenting a subset of the information contained in a natural language text. However, while IE extracts structural representations, summarization aims to abstract the most salient information into a generated text summary – thus potentially encountering the technical limitations of current text generation methods (e.g., hallucination). To mitigate this risk, this work uses structured IE graphs to enhance the abstractive summarization task. Specifically, we focus on improving Multi-Document Summarization (MDS) performance by using cross-document IE output, incorporating two novel components: (1) the use of auxiliary entity and event recognition systems to focus the summary generation model; (2) incorporating an alignment loss between IE nodes and their text spans to reduce inconsistencies between the IE graphs and text representations. Operationally, both the IE nodes and corresponding text spans are projected into the same embedding space and pairwise distance is minimized. Experimental results on multiple MDS benchmarks show that summaries generated by our model are more factually consistent with the source documents than baseline models while maintaining the same level of abstractiveness.

Keywords:
Automatic summarization Computer science Pairwise comparison Text graph Focus (optics) Information retrieval Salient Natural language processing Source document Information extraction Multi-document summarization Graph Baseline (sea) Task (project management) Artificial intelligence Theoretical computer science

Metrics

11
Cited By
2.81
FWCI (Field Weighted Citation Impact)
35
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

Multi-document Summarization for Terrorism Information Extraction

Fu Lee WangChristopher C. YangXiaodong Shi

Lecture notes in computer science Year: 2006 Pages: 602-608
JOURNAL ARTICLE

Graph based Multi-Document Summarization with Latent Topics

Risa KitajimaIchiro Kobayashi

Journal:   Journal of Japan Society for Fuzzy Theory and Intelligent Informatics Year: 2013 Vol: 25 (6)Pages: 914-923
JOURNAL ARTICLE

Unsupervised Graph-Based Tibetan Multi-Document Summarization

Xiaodong YanYiqin WangWei SongXiaobing ZhaoA. RunYanxing Yang

Journal:   Computers, materials & continua/Computers, materials & continua (Print) Year: 2022 Vol: 73 (1)Pages: 1769-1781
© 2026 ScienceGate Book Chapters — All rights reserved.