JOURNAL ARTICLE

A Hierarchical Representation Model Based on Longformer and Transformer for Extractive Summarization

Shihao YangShaoru ZhangMing FangFengqin YangShuhua Liu

Year: 2022 Journal:   Electronics Vol: 11 (11)Pages: 1706-1706   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Automatic text summarization is a method used to compress documents while preserving the main idea of the original text, including extractive summarization and abstractive summarization. Extractive text summarization extracts important sentences from the original document to serve as the summary. The document representation method is crucial for the quality of the generated summarization. To effectively represent the document, we propose a hierarchical document representation model Long-Trans-Extr for Extractive Summarization, which uses Longformer as the sentence encoder and Transformer as the document encoder. The advantage of Longformer as sentence encoder is that the model can input long document up to 4096 tokens with adding relative a little calculation. The proposed model Long-Trans-Extr is evaluated on three benchmark datasets: CNN (Cable News Network), DailyMail, and the combined CNN/DailyMail. It achieves 43.78 (Rouge-1) and 39.71 (Rouge-L) on CNN/DailyMail and 33.75 (Rouge-1), 13.11 (Rouge-2), and 30.44 (Rouge-L) on the CNN datasets. They are very competitive results, and furthermore, they show that our model has better performance on long documents, such as the CNN corpus.

Keywords:
Automatic summarization Computer science Transformer Encoder Sentence Artificial intelligence Natural language processing Representation (politics) Benchmark (surveying) ROUGE Information retrieval

Metrics

11
Cited By
2.15
FWCI (Field Weighted Citation Impact)
30
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.