JOURNAL ARTICLE

Feature selection for Chinese Text Categorization based on improved particle swarm optimization

Abstract

Feature selection is an important preprocessing step of Chinese Text Categorization, which reduces the high dimension and keeps the reduced results comprehensible compared to feature extraction. A novel criterion to filter features coarsely is proposed, which integrating the superiorities of term frequency-inverse document frequency as inner-class measure and CHI-square as inter-class, and a new feature selection method for Chinese text categorization based on swarm intelligence is presented, which using improved particle swarm optimization to select features fine on the results of coarse grain filtering, and utilizing support vector machine to evaluate feature subsets and taking the evaluations as the fitness of particles. The experiments on Fudan University Chinese Text Classification Corpus show a higher classification accuracy obtained by using the new criterion for features filtering and an effective feature reduction ratio attained by utilizing the novel FS method for Chinese text categorization.

Keywords:
Feature selection Particle swarm optimization Computer science Artificial intelligence Pattern recognition (psychology) Preprocessor Feature (linguistics) Feature extraction Categorization Filter (signal processing) Selection (genetic algorithm) Text categorization Dimensionality reduction Dimension (graph theory) Machine learning Mathematics

Metrics

16
Cited By
0.80
FWCI (Field Weighted Citation Impact)
19
Refs
0.79
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Computational Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Algorithms and Applications
Physical Sciences →  Engineering →  Control and Systems Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.