JOURNAL ARTICLE

Feature Selection on K-Nearest Neighbor Algorithm Using Similarity Measure

Abstract

Data mining was the data processing technique to be obtained knowledge or important pattern of data. One of the popular methods was the KNN (K-Nearest Neighbor) which was computational simplicity. There was some weakness of KNN, vulnerable in the data high dimensionality. It was caused of data high dimensionality, so that space can be occupied in instance to be greater. The method approach would be proposed in the research was the method to be omitted several numbers of features were irrelevant to the KNN method by using similarity measures (Euclidean distance, Correlation distance and Cosine similarity). The testing was done by testing the different three datasets and computed the average of accurate results. The results of testing have successes to be omitted the data features without decreasing accuracy, that the accuracy average of feature selection using the KNN algorithm: KNN without selection was 88.804%, KNN-Euclidean distance was 89.120%, KNN-Correlation distance is 89.567% and KNN-Cosine similarity was 89.134%.

Keywords:
k-nearest neighbors algorithm Euclidean distance Pattern recognition (psychology) Similarity (geometry) Artificial intelligence Curse of dimensionality Computer science Feature selection Cosine similarity Nearest neighbor search Correlation Distance measures Selection (genetic algorithm) Feature (linguistics) Dimensionality reduction Similarity measure Data mining Mathematics Image (mathematics)

Metrics

9
Cited By
1.13
FWCI (Field Weighted Citation Impact)
24
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Mining and Machine Learning Applications
Physical Sciences →  Computer Science →  Information Systems
Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems
Information Retrieval and Data Mining
Physical Sciences →  Computer Science →  Information Systems

Related Documents

BOOK-CHAPTER

Classifying News Articles Using Feature Similarity K Nearest Neighbor

Taeho Jo

Lecture notes in electrical engineering Year: 2018 Pages: 73-78
JOURNAL ARTICLE

Semantic Word Categorization using Feature Similarity based K Nearest Neighbor

Taeho Jo

Journal:   Journal of Multimedia Information System Year: 2018 Vol: 5 (2)Pages: 67-78
© 2026 ScienceGate Book Chapters — All rights reserved.