English Text Classification Using Improved Recursive Feature Elimination (IRFE) Algorithm

Ahmed H. Aliwy Esraa H. Al-Ameer Esraa H. Al-Ameer

doi:10.26389/ajsrp.r080420

ScienceGate Book Chapters

JOURNAL ARTICLE

English Text Classification Using Improved Recursive Feature Elimination (IRFE) Algorithm

Ahmed H. Aliwy Esraa H. Al-Ameer Esraa H. Al-Ameer

Year: 2020 Journal: مجلة العلوم الهندسية و تكنولوجيا المعلومات Vol: 4 (2)Pages: 120-110

DOI: 10.26389/ajsrp.r080420

Get Full-Text PDF Get Analytical Report

Abstract

Documents classification is from most important fields for Natural language processing and text mining. There are many algorithms can be used for this task. In this paper, focuses on improving Text Classification by feature selection. This means determine some of the original features without affecting the accuracy of the work, where our work is a new feature selection method was suggested which can be a general formulation and mathematical model of Recursive Feature Elimination (RFE). The used method was compared with other two well-known feature selection methods: Chi-square and threshold. The results proved that the new method is comparable with the other methods, The best results were 83% when 60% of features used, 82% when 40% of features used, and 82% when 20% of features used. The tests were done with the Naïve Bayes (NB) and decision tree (DT) classification algorithms , where the used dataset is a well-known English data set “20 newsgroups text” consists of approximately 18846 files. The results showed that our suggested feature selection method is comparable with standard Like Chi-square.

Keywords:

Feature selection Feature (linguistics) Computer science Decision tree Naive Bayes classifier Artificial intelligence Pattern recognition (psychology) Selection (genetic algorithm) Set (abstract data type) Task (project management) Data mining Algorithm Statistical classification Machine learning Support vector machine

Metrics

Cited By

0.15

FWCI (Field Weighted Citation Impact)

Refs

0.59

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

English Text Classification Using Improved Recursive Feature Elimination (IRFE) Algorithm

Abstract

Metrics

Citation History

Topics

Related Documents

Recursive feature elimination algorithm feature definitions.

Parkinson’s disease classification using nature inspired feature selection and recursive feature elimination

Random Forest Optimization Using Recursive Feature Elimination for Stunting Classification

Performance Enhancement of Raga Classification Systems Using Recursive Feature Elimination

Recursive Feature Elimination and Gravitational Search Algorithm for Classification of Medical Data