JOURNAL ARTICLE

Supervising the Centroid Baseline for Extractive Multi-Document Summarization

Abstract

The centroid method is a simple approach for extractive multi-document summarization and many improvements to its pipeline have been proposed. We further refine it by adding a beam search process to the sentence selection and also a centroid estimation attention model that leads to improved results. We demonstrate this in several multi-document summarization datasets, including in a multilingual scenario.

Keywords:
Automatic summarization Centroid Computer science Multi-document summarization Pipeline (software) Baseline (sea) Selection (genetic algorithm) Sentence Data mining Process (computing) Artificial intelligence Information retrieval

Metrics

1
Cited By
0.26
FWCI (Field Weighted Citation Impact)
34
Refs
0.59
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.