Multi-Document Summarization of Persian Text Using Paragraph Vectors

Morteza Rohanian

doi:10.26615/issn.1314-9156.2017_005

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-Document Summarization of Persian Text Using Paragraph Vectors

Morteza Rohanian

Year: 2017 Journal: Student Research Workshop .../Proceedings of the Student Research Workshop ... Pages: 35-40

DOI: 10.26615/issn.1314-9156.2017_005

Get Full-Text PDF Get Analytical Report

Abstract

A multi-document summarizer finds the key topics from multiple textual sources and organizes information around them.In this paper we propose a summarization method for Persian text using paragraph vectors that can represent textual units of arbitrary lengths.We use these vectors to calculate the semantic relatedness between documents, cluster them to a number of predetermined groups, weight them based on their distance to the centroids and the intra-cluster homogeneity and take out the key paragraphs.We compare the final summaries with the goldstandard summaries of 21 digital topics using the ROUGE evaluation metric.Experimental results show the advantages of using paragraph vectors over earlier attempts at developing similar methods for a low resource language like Persian.

Keywords:

Automatic summarization Paragraph Computer science Persian Natural language processing Information retrieval Centroid Artificial intelligence Key (lock) Linguistics World Wide Web

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.15

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Multi-Document Summarization of Persian Text Using Paragraph Vectors

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-document Text Summarization Using Sentence Extraction

Hindi Multi-document Text Summarization using Text Rank Algorithm

Multi-document Text Summarization Tool

Multi-Document Text Summarization Using Deep Belief Network

Text Summarization in Multi Document Using Genetic Algorithm