JOURNAL ARTICLE

Feature Selection Method of Text Tendency Classification

Abstract

Recently, automatic text categorization has made rapid progress and been one of the hotspots in the information processing field. Text tendency classification is one type of text categorization, which has very important applications in information retrievals bad information identification and filtering , content security management and analysis of public opinion tendency. To aim at the important influence of feature selection on text classification accuracy, this paper mainly studied feature selection method of tendency classification. First, to analyze and summarize the current variety methods, it points out three common ideas of feature selection. Then based on the analysis of complexity of tendency classification, it is proved that feature selection method based on the features' distribution in text categories is more suitable for tendency classification than the method based on the correlativity of features and categories. Finally, it gives test results for balanced training sets and unbalanced training sets.

Keywords:
Feature selection Categorization Computer science Selection (genetic algorithm) Text categorization Variety (cybernetics) Artificial intelligence Feature (linguistics) Field (mathematics) Identification (biology) Data mining Pattern recognition (psychology) Statistical classification Information retrieval Machine learning Mathematics

Metrics

8
Cited By
0.40
FWCI (Field Weighted Citation Impact)
8
Refs
0.79
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Rough Sets and Fuzzy Logic
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.