JOURNAL ARTICLE

Feature selection for text classification using genetic algorithms

Abstract

In text classification, feature selection is essential to improve the classification effectiveness. This paper provides an empirical study of a feature selection method based on genetic algorithms for different text representation methods. This feature selection algorithm can accomplish two goals: in one hand is the search of a feature subset such that the performance of classifier is best; in other hands is find a feature subset with the smallest dimensionality which achieves higher accuracy in classification. To evaluate the performance of this approach, three from the best classifiers have been selected: Naive Bayes (NB), Nearest Neighbors (KNN) and Support Vector Machines (SVMs). Our objective is to determine whether the genetic algorithms based feature selection will improve the performances in text classification with smaller size using F-measure. Experimentations were carried out on two benchmark document collections 20Newsgroups, and Reuters-21578. And the results were very interesting.

Keywords:
Feature selection Computer science Artificial intelligence Naive Bayes classifier Support vector machine Pattern recognition (psychology) Classifier (UML) Curse of dimensionality k-nearest neighbors algorithm Feature (linguistics) Machine learning Statistical classification Benchmark (surveying) Dimensionality reduction Selection (genetic algorithm) Feature extraction

Metrics

56
Cited By
3.66
FWCI (Field Weighted Citation Impact)
28
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Rough Sets and Fuzzy Logic
Physical Sciences →  Computer Science →  Computational Theory and Mathematics

Related Documents

BOOK-CHAPTER

Feature Selection for Text Classification Using Genetic Algorithm

Salma BelkarkorImad HafidiMourad Nachaoui

Lecture notes in networks and systems Year: 2023 Pages: 69-80
JOURNAL ARTICLE

Feature Selection for ECG Beat Classification using Genetic Algorithms

Çağla SarvanNalan ÖzkurtKorhan Karabulut

Journal:   Journal of Intelligent Systems with Applications Year: 2018 Pages: 149-156
JOURNAL ARTICLE

Classification through Enhanced Adaptive Hybrid Feature Selection using Genetic Algorithms

Dr. Viharos, Zsolt János

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2024
© 2026 ScienceGate Book Chapters — All rights reserved.