Verónica Neri-MendozaYulia LedenevaRené Arnulfo García-Hernández
The task of Extractive Multi-Document Text Summarization (EMDTS) aims at building a short summary with essential information from a collection of documents. In this paper, we propose an EMDTS method using a Genetic Algorithm (GA). The fitness function considering two unsupervised text features: sentence position and coverage. We propose the binary coding representation, selection, crossover, and mutation operators. We test the proposed method on the DUC01 and DUC02 data set, four different tasks (summary lengths 200 and 400 words), for each of the collections of documents (in total, 876 documents) are tested. Besides, we analyze the most frequently used methodologies to summarization. Moreover, different heuristics such as topline, baseline, baseline-random, and lead baseline are calculated. In the results, the proposed method achieves to improve the state-of-art results.
Niladri ChatterjeeGautam JainGurkirat Singh Bajwa
Mohammad MojrianSeyed Abolghasem Mirroshandel
Niladri ChatterjeeAmol MittalShubham Goyal
Alok RaiYashashree PatilPooja SulakheG. Sugin Lal