Unsupervised extractive multi-document text summarization using a genetic algorithm

Verónica Neri-Mendoza; Yulia Ledeneva; René Arnulfo García-Hernández

doi:10.3233/jifs-179900

ScienceGate Book Chapters

JOURNAL ARTICLE

Unsupervised extractive multi-document text summarization using a genetic algorithm

Verónica Neri-Mendoza Yulia Ledeneva René Arnulfo García-Hernández

Year: 2020 Journal: Journal of Intelligent & Fuzzy Systems Vol: 39 (2)Pages: 2397-2408 Publisher: IOS Press

DOI: 10.3233/jifs-179900

Get Full-Text PDF Get Analytical Report

Abstract

The task of Extractive Multi-Document Text Summarization (EMDTS) aims at building a short summary with essential information from a collection of documents. In this paper, we propose an EMDTS method using a Genetic Algorithm (GA). The fitness function considering two unsupervised text features: sentence position and coverage. We propose the binary coding representation, selection, crossover, and mutation operators. We test the proposed method on the DUC01 and DUC02 data set, four different tasks (summary lengths 200 and 400 words), for each of the collections of documents (in total, 876 documents) are tested. Besides, we analyze the most frequently used methodologies to summarization. Moreover, different heuristics such as topline, baseline, baseline-random, and lead baseline are calculated. In the results, the proposed method achieves to improve the state-of-art results.

Keywords:

Automatic summarization Computer science Heuristics Crossover Multi-document summarization Baseline (sea) Sentence Coding (social sciences) Selection (genetic algorithm) Set (abstract data type) Binary number Genetic algorithm Artificial intelligence Information retrieval Algorithm Machine learning Statistics Mathematics

Metrics

Cited By

0.59

FWCI (Field Weighted Citation Impact)

Refs

0.72

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Unsupervised extractive multi-document text summarization using a genetic algorithm

Abstract

Metrics

Citation History

Topics

Related Documents

Text Summarization in Multi Document Using Genetic Algorithm

Single Document Extractive Text Summarization Using Neural Networks and Genetic Algorithm

A novel extractive multi-document text summarization system using quantum-inspired genetic algorithm: MTSQIGA

Single document extractive text summarization using Genetic Algorithms

Extractive text Summarization using Genetic Clustering Algorithm