JOURNAL ARTICLE

A new feature selection algorithm in text categorization

Abstract

A major problem with text classification problems is the high dimensionality of the feature space. This paper investigates how genetic algorithm and k-means algorithm can help select relevant features in text classification. which uses the genetic algorithm (GA) optimization features to implement global searching, and uses k-means algorithm to selection operation to control the scope of the search, ensure the validity of each gene and the speed of convergence. Our experimental results show that the combination of GA and k-means algorithm is quite useful in reduce the high feature dimension, and improved accuracy and efficiency for text classification.

Keywords:
Computer science Feature selection Text categorization Curse of dimensionality Convergence (economics) Dimension (graph theory) Genetic algorithm Scope (computer science) Feature (linguistics) Artificial intelligence Selection (genetic algorithm) Statistical classification Population-based incremental learning Categorization Pattern recognition (psychology) Algorithm Algorithm design Data mining Machine learning Mathematics

Metrics

5
Cited By
0.80
FWCI (Field Weighted Citation Impact)
6
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems
Algorithms and Data Compression
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

A novel feature selection algorithm for text categorization

Wenqian ShangHoukuan HuangHaibin ZhuYongmin LinYouli QuZhihai Wang

Journal:   Expert Systems with Applications Year: 2006 Vol: 33 (1)Pages: 1-5
JOURNAL ARTICLE

TEXT CATEGORIZATION BASED ON FEATURE SELECTION TECHNIQUE AND BGA ALGORITHM

Journal:   International Journal of Advance Engineering and Research Development Year: 2016 Vol: 3 (05)
JOURNAL ARTICLE

Best terms: an efficient feature-selection algorithm for text categorization

Dimitris FragoudisDimitris MeretakisSpiridon Likothanassis

Journal:   Knowledge and Information Systems Year: 2004 Vol: 8 (1)Pages: 16-33
© 2026 ScienceGate Book Chapters — All rights reserved.