JOURNAL ARTICLE

PAAD: POLITICAL ARABIC ARTICLES DATASET FOR AUTOMATIC TEXT CATEGORIZATION

Dhafar HamedAhmed T. SadiqAyad R. Abbas

Year: 2020 Journal:   Iraqi Journal for Computers and Informatics Vol: 46 (1)Pages: 1-10   Publisher: University of Information Technology and Communications

Abstract

Now day’s text Classification and Sentiment analysis is considered as one of the popular Natural Language Processing (NLP) tasks. This kind of technique plays significant role in human activities and has impact on the daily behaviours. Each article in different fields such as politics and business represent different opinions according to the writer tendency. A huge amount of data will be acquired through that differentiation. The capability to manage the political orientation of an online article automatically. Therefore, there is no corpus for political categorization was directed towards this task in Arabic, due to the lack of rich representative resources for training an Arabic text classifier. However, we introduce political Arabic articles dataset (PAAD) of textual data collected from newspapers, social network, general forum and ideology website. The dataset is 206 articles distributed into three categories as (Reform, Conservative and Revolutionary) that we offer to the research community on Arabic computational linguistics. We anticipate that this dataset would make a great aid for a variety of NLP tasks on Modern Standard Arabic, political text classification purposes. We present the data in raw form and excel file. Excel file will be in four types such as V1 raw data, V2 preprocessing, V3 root stemming and V4 light stemming.

Keywords:
Computer science Natural language processing Variety (cybernetics) Artificial intelligence Categorization Classifier (UML) Newspaper Sentiment analysis Arabic Preprocessor Raw data Politics Data science Linguistics Political science

Metrics

8
Cited By
1.03
FWCI (Field Weighted Citation Impact)
43
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Misinformation and Its Impacts
Social Sciences →  Social Sciences →  Sociology and Political Science

Related Documents

JOURNAL ARTICLE

SANAD: Single-label Arabic News Articles Dataset for automatic text categorization

Omar EineaAshraf ElnagarRidhwan Al Debsi

Journal:   Data in Brief Year: 2019 Vol: 25 Pages: 104076-104076
JOURNAL ARTICLE

A Moroccan News Articles Dataset (MNAD) For Arabic Text Categorization

Mourad JbeneSmail TiganiRachid SaadaneAbdellah Chehri

Journal:   2021 International Conference on Decision Aid Sciences and Application (DASA) Year: 2021 Pages: 350-353
JOURNAL ARTICLE

Automatic Learning of Arabic Text Categorization

Abdulrahman Al-MolegiIzzat AlsmadiHassan NajadatHaile Albashiri

Journal:   International Journal of Digital Contents and Applications for Smart Devices Year: 2015 Vol: 2 (1)Pages: 1-16
© 2026 ScienceGate Book Chapters — All rights reserved.