JOURNAL ARTICLE

Unsupervised extractive multi-document text summarization using a genetic algorithm

Verónica Neri-MendozaYulia LedenevaRené Arnulfo García-Hernández

Year: 2020 Journal:   Journal of Intelligent & Fuzzy Systems Vol: 39 (2)Pages: 2397-2408   Publisher: IOS Press

Abstract

The task of Extractive Multi-Document Text Summarization (EMDTS) aims at building a short summary with essential information from a collection of documents. In this paper, we propose an EMDTS method using a Genetic Algorithm (GA). The fitness function considering two unsupervised text features: sentence position and coverage. We propose the binary coding representation, selection, crossover, and mutation operators. We test the proposed method on the DUC01 and DUC02 data set, four different tasks (summary lengths 200 and 400 words), for each of the collections of documents (in total, 876 documents) are tested. Besides, we analyze the most frequently used methodologies to summarization. Moreover, different heuristics such as topline, baseline, baseline-random, and lead baseline are calculated. In the results, the proposed method achieves to improve the state-of-art results.

Keywords:
Automatic summarization Computer science Heuristics Crossover Multi-document summarization Baseline (sea) Sentence Coding (social sciences) Selection (genetic algorithm) Set (abstract data type) Binary number Genetic algorithm Artificial intelligence Information retrieval Algorithm Machine learning Statistics Mathematics

Metrics

6
Cited By
0.59
FWCI (Field Weighted Citation Impact)
40
Refs
0.72
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Text Summarization in Multi Document Using Genetic Algorithm

Nirwana HendrastutyAzhari SN

Journal:   IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Year: 2021 Vol: 15 (4)Pages: 327-327
JOURNAL ARTICLE

A novel extractive multi-document text summarization system using quantum-inspired genetic algorithm: MTSQIGA

Mohammad MojrianSeyed Abolghasem Mirroshandel

Journal:   Expert Systems with Applications Year: 2021 Vol: 171 Pages: 114555-114555
© 2026 ScienceGate Book Chapters — All rights reserved.