Feature Selection based Arabic Text Classification using Different Machine Learning Algorithms

Sakina Rim Bennabi; Zakaria Elberrichi

doi:10.1145/3447568.3448531

ScienceGate Book Chapters

JOURNAL ARTICLE

Feature Selection based Arabic Text Classification using Different Machine Learning Algorithms

Sakina Rim Bennabi Zakaria Elberrichi

Year: 2020 Pages: 1-5

DOI: 10.1145/3447568.3448531

Get Full-Text PDF Get Analytical Report

Abstract

Feature selection is a method of data pre-processing widely used when mining large data, such as textual classification. Several studies have been conducted to compare the different methods of feature selection applied to corpora in English. Unfortunately, a small number of works concern the Arabic language. This article aims to present a comparative study of different feature selection techniques including: Chi2, the ANOVA method and mutual information, applied on a corpus in Arabic language, while also diversifying the machine learning algorithms (Naive Bayes, SVM and KNN). This experimental study has shown in general that reducing dimensionality with feature selection techniques has slightly affected the performance of textual classification, reducing the size of the corpus by up to 1%.

Keywords:

Feature selection Computer science Artificial intelligence Naive Bayes classifier Support vector machine Feature (linguistics) Selection (genetic algorithm) Dimensionality reduction Arabic Natural language processing Curse of dimensionality Machine learning Mutual information Statistical classification Feature extraction k-nearest neighbors algorithm Pattern recognition (psychology) Linguistics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.21

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Spam and Phishing Detection

Physical Sciences → Computer Science → Information Systems

Feature Selection based Arabic Text Classification using Different Machine Learning Algorithms

Abstract

Metrics

Topics

Related Documents

Different Classification Algorithms Based on Arabic Text Classification: Feature Selection Comparative Study

Arabic text classification using machine learning and deep learning algorithms

Feature Selection for Text Classification Using Machine Learning Approaches

Arabic Language Text Classification Using Dependency Syntax-Based Feature Selection

Ontology based Feature Selection and Weighting for Text classification using Machine Learning