JOURNAL ARTICLE

Feature Selection by Using Heuristic Methods for Text Classification

İlhami SELCelalettin YeroğluDavut Hanbay

Year: 2019 Journal:   2019 International Artificial Intelligence and Data Processing Symposium (IDAP) Pages: 1-6

Abstract

Feature selection can be defined as the selection of the best subset to represent the data set in machine learning applications, in other words extraction of the unnecessary data that has no effect on the result. In classification problems efficiency and accuracy of the system can be increased when the dimension is reduced by feature selection. In this study, text classifying application is performed by using the data set of "20 News Group" released in Reuters News Agent. The pre-processed news data were converted to vectors by using Doc2Vec method and the data set was created and classified by Naive Bayes method. Subsequently, a subset of the data set was formed by using heuristic methods that were inspired by nature (Whale and Gray Wolf Optimization Algorithms) and Chi-square method for feature selection. Then the reclassification was applied and the results were compared. While the success of the system with 600 features before the feature selection is 0.9214, the performance ratio of the 100 featured models created later is figured higher (0.94095 - 0.93833 - 0.93619).

Keywords:
Computer science Feature selection Artificial intelligence Heuristic Selection (genetic algorithm) Feature (linguistics) Pattern recognition (psychology) Machine learning Data mining Natural language processing

Metrics

1
Cited By
0.00
FWCI (Field Weighted Citation Impact)
10
Refs
0.13
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Text Classification using KNN with different Feature Selection Methods

Rajshree JodhaGaur Sanjay B.CK. R. Chowdhary

Journal:   International Journal of Research Publications Year: 2018 Vol: 09 (1)Pages: 8-8
JOURNAL ARTICLE

Redundant Feature Selection Methods in Text Classification

Su Fen Chen

Journal:   Advanced materials research Year: 2014 Vol: 1044-1045 Pages: 1258-1261
JOURNAL ARTICLE

Review of feature selection methods for text classification

Muhammad Kashif IqbalMalik Muneeb AbidMuhammad Noman KhalidAmir Manzoor

Journal:   International Journal of Advanced Computer Research Year: 2020 Vol: 10 (49)Pages: 138-152
© 2026 ScienceGate Book Chapters — All rights reserved.