JOURNAL ARTICLE

Feature Selection based Arabic Text Classification using Different Machine Learning Algorithms

Abstract

Feature selection is a method of data pre-processing widely used when mining large data, such as textual classification. Several studies have been conducted to compare the different methods of feature selection applied to corpora in English. Unfortunately, a small number of works concern the Arabic language. This article aims to present a comparative study of different feature selection techniques including: Chi2, the ANOVA method and mutual information, applied on a corpus in Arabic language, while also diversifying the machine learning algorithms (Naive Bayes, SVM and KNN). This experimental study has shown in general that reducing dimensionality with feature selection techniques has slightly affected the performance of textual classification, reducing the size of the corpus by up to 1%.

Keywords:
Feature selection Computer science Artificial intelligence Naive Bayes classifier Support vector machine Feature (linguistics) Selection (genetic algorithm) Dimensionality reduction Arabic Natural language processing Curse of dimensionality Machine learning Mutual information Statistical classification Feature extraction k-nearest neighbors algorithm Pattern recognition (psychology) Linguistics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
11
Refs
0.21
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Spam and Phishing Detection
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Different Classification Algorithms Based on Arabic Text Classification: Feature Selection Comparative Study

Ghazi. I RahoRiyad Al–ShalabiGhassan KanaanAsma'a Nassar

Journal:   International Journal of Advanced Computer Science and Applications Year: 2015 Vol: 6 (2)
JOURNAL ARTICLE

Arabic text classification using machine learning and deep learning algorithms

Rawad Awad AlqahtaniHoda Ahmed Abdelhafez

Journal:   IAES International Journal of Artificial Intelligence Year: 2025 Vol: 14 (6)Pages: 5201-5201
JOURNAL ARTICLE

Feature Selection for Text Classification Using Machine Learning Approaches

K. ThirumoorthyK. Muneeswaran

Journal:   National Academy Science Letters Year: 2021 Vol: 45 (1)Pages: 51-56
JOURNAL ARTICLE

Ontology based Feature Selection and Weighting for Text classification using Machine Learning

Djelloul BouchihaAbdelghani BouzianeNoureddine Doumi

Journal:   Journal of Information Technology and Computing Year: 2023 Vol: 4 (1)Pages: 1-14
© 2026 ScienceGate Book Chapters — All rights reserved.