Bidirectional Encoder Approach for Abstractive Text Summarization of Urdu Language

Muhammad Asif; Syed Ali Raza; Javed Iqbal; Nousheen Perwaiz; Tauqeer Faiz; Shan Khan

doi:10.1109/icbats54253.2022.9759026

ScienceGate Book Chapters

JOURNAL ARTICLE

Bidirectional Encoder Approach for Abstractive Text Summarization of Urdu Language

Muhammad Asif Syed Ali Raza Javed Iqbal Nousheen Perwaiz Tauqeer Faiz Shan Khan

Year: 2022 Journal: 2022 International Conference on Business Analytics for Technology and Security (ICBATS) Pages: 1-8

DOI: 10.1109/icbats54253.2022.9759026

Get Full-Text PDF Get Analytical Report

Abstract

The fast pace of accumulating textual data in the online sphere has made it laborious to get out handy information from a profuse amount of information. NLP's area: automate text summarization, yields a great quality and considerable gist; abstracts and summaries of written texts of myriad human languages. Several attempts have been carried out previously in extractive summarization systems; however, research in abstractive summarization in the Urdu language has not been studied well so far. Urdu is a very rich language in terms of literary sources and it requires serious research efforts to generate abstractive summaries. In this research, we employ a composition of abstractive and extractive algorithms in an automated text summarization system for the Urdu language. In extractive summaries, we use word frequency, Sentence weight, and TF-IDF algorithms. Further, a hybrid method is introduced to improve the results of extractive summaries. Bidirectional Encoder Representations from Transformers (BERT) model is used to process the summaries generated by hybrid method for generation of abstractive summary. To evaluate the system-generated summaries, the assistance of the experts of Urdu language is reaped.

Keywords:

Automatic summarization Urdu Computer science Natural language processing Encoder Artificial intelligence Linguistics

Metrics

Cited By

0.71

FWCI (Field Weighted Citation Impact)

Refs

0.66

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Bidirectional Encoder Approach for Abstractive Text Summarization of Urdu Language

Abstract

Metrics

Citation History

Topics

Related Documents

An Approach Toward Abstractive Text Summarization for Urdu Language Using LLM (ATSUL)

Abstractive Text Summarization for the Urdu Language: Data and Methods

RUATS: Abstractive Text Summarization for Roman Urdu

Keyword-Aware Encoder for Abstractive Text Summarization

IWM-LSTM encoder for abstractive text summarization