Phrase Embedding Based Multi Document Summarization with Reduced Redundancy using Maximal Marginal Relevance

Sakkaravarthy Iyyappan K; S. R. Balasundaram

doi:10.1109/iceltics50595.2020.9315474

ScienceGate Book Chapters

JOURNAL ARTICLE

Phrase Embedding Based Multi Document Summarization with Reduced Redundancy using Maximal Marginal Relevance

Sakkaravarthy Iyyappan K S. R. Balasundaram

Year: 2020 Pages: 1-5

DOI: 10.1109/iceltics50595.2020.9315474

Get Full-Text PDF Get Analytical Report

Abstract

In the Internet Era of Information due to the exponential increase of textual data, Multi Document Summarization (MDS) is becoming an inevitable NLP task that aims to produce a concise representation of the main idea of multiple related documents. MDS becomes difficult and challenging to produce a non-redundant summary because of the lexical diversity of multiple authors. This paper proposes a new multi-document summarization system based on phrase embedding and greedy Maximal Marginal Relevance (MMR) algorithm. This approach considers phrases as the basic meaningful semantic unit of the sentences to understand and summarize documents. Embedding techniques are employed to learn the vector representation of phrases to identify similar phrases semantically. Finally, an MMR based greedy algorithm is used to select sentences with important phrases while reducing the redundancy among similar phrases. Experimental results on the benchmark dataset DUC 2004 show better performance gains compared with the state-of-the-art baselines.

Keywords:

Computer science Automatic summarization Phrase Redundancy (engineering) Natural language processing Artificial intelligence Embedding Relevance (law) Information retrieval

Metrics

Cited By

0.15

FWCI (Field Weighted Citation Impact)

Refs

0.58

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Phrase Embedding Based Multi Document Summarization with Reduced Redundancy using Maximal Marginal Relevance

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning

Unsupervised Query-Focused Multi-document Summarization Using uSIF Sentence Embedding Model and Maximal Marginal Relevance Criterion

Khmer multi-document extractive summarization method based on hierarchical maximal marginal relevance

Abstractive Multi-Document Summarization: Exploiting Maximal Marginal Relevance and Pretrained Models

Combined Features to Maximal Marginal Relevance Algorithm for Multi-document Summarization