Abstract

Automatic text summarization is a growing area of natural language processing research. Using extractive text summarization approach a concise summary of the input information sources is developed by selecting phrases and sentences on a given selection criterion that can be based on features e.g. syntactic, semantic, temporal, positional, etc. Text summarization for under resourced languages e.g. Urdu is even more challenging, due to the limited availability of basic computational resources to effectively extract textual features. Surmounting these challenges, this paper presents an extractive text summarization methodology for Urdu language documents based on sentence weight algorithm using segmentation, tokenization and stopwords as prominent features. ROUGE metric is used for system evaluation by comparing system generated and human generated summaries. System accuracy at Unigram, bigram and trigram level is 67 percent.

Keywords:
Automatic summarization Computer science Bigram Natural language processing Artificial intelligence Trigram Hindi Urdu Sentence Text segmentation Multi-document summarization Information retrieval Segmentation Linguistics

Metrics

10
Cited By
0.40
FWCI (Field Weighted Citation Impact)
16
Refs
0.71
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

A hybrid PSO model in Extractive Text Summarizer

Oi-Mean FoongAlan Oxley

Year: 2011 Vol: 191 Pages: 130-134
BOOK-CHAPTER

An Extractive Text Summarizer Based on Significant Words

Xiaoyue LiuJonathan J. WebsterChunyu Kit

Lecture notes in computer science Year: 2009 Pages: 168-178
JOURNAL ARTICLE

HINDI TEXT SUMMARIZER USING ABSTRACTIVE AND EXTRACTIVE TECHNIQUE

Neelam Phadnis Gurveen Kaur Bans

Journal:   International Journal of Apllied Mathematics Year: 2025 Vol: 38 (1s)Pages: 134-149
© 2026 ScienceGate Book Chapters — All rights reserved.