JOURNAL ARTICLE

Document Summarization Using Sentence-Level Semantic Based on Word Embeddings

Kamal Al‐SabahiZuping Zhang

Year: 2019 Journal:   International Journal of Software Engineering and Knowledge Engineering Vol: 29 (02)Pages: 177-196   Publisher: World Scientific

Abstract

In the era of information overload, text summarization has become a focus of attention in a number of diverse fields such as, question answering systems, intelligence analysis, news recommendation systems, search results in web search engines, and so on. A good document representation is the key point in any successful summarizer. Learning this representation becomes a very active research in natural language processing field (NLP). Traditional approaches mostly fail to deliver a good representation. Word embedding has proved an excellent performance in learning the representation. In this paper, a modified BM25 with Word Embeddings are used to build the sentence vectors from word vectors. The entire document is represented as a set of sentence vectors. Then, the similarity between every pair of sentence vectors is computed. After that, TextRank, a graph-based model, is used to rank the sentences. The summary is generated by picking the top-ranked sentences according to the compression rate. Two well-known datasets, DUC2002 and DUC2004, are used to evaluate the models. The experimental results show that the proposed models perform comprehensively better compared to the state-of-the-art methods.

Keywords:
Computer science Automatic summarization Natural language processing Word embedding Sentence Artificial intelligence Word (group theory) Rank (graph theory) Information retrieval Representation (politics) Text graph Word2vec Set (abstract data type) Embedding Mathematics

Metrics

9
Cited By
0.92
FWCI (Field Weighted Citation Impact)
14
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Latent Semantic Analysis Approach for Document Summarization Based on Word Embeddings

Kamal Al-SabahiZhang ZupingYang Kang

Journal:   KSII Transactions on Internet and Information Systems Year: 2019 Vol: 13 (1)
JOURNAL ARTICLE

Single-Document Summarization Using Sentence Embeddings and K-Means Clustering

Sanchit AgarwalNikhil Kumar SinghPriyanka Meel

Journal:   2018 International Conference on Advances in Computing, Communication Control and Networking (ICACCCN) Year: 2018 Pages: 162-165
JOURNAL ARTICLE

Multi-document Summarization Sentence Ordering Algorithm Using Semantic Analysis

Ji Hong MinLiao JunbiLEI Jingfa -Yuan Zhong-fan

Journal:   INTERNATIONAL JOURNAL ON Advances in Information Sciences and Service Sciences Year: 2012 Vol: 4 (14)Pages: 125-131
© 2026 ScienceGate Book Chapters — All rights reserved.