Feature Selection on K-Nearest Neighbor Algorithm Using Similarity Measure

Ratih Puspadini; Herman Mawengkang; Syahril Efendi

doi:10.1109/mecnit48290.2020.9166612

ScienceGate Book Chapters

JOURNAL ARTICLE

Feature Selection on K-Nearest Neighbor Algorithm Using Similarity Measure

Ratih Puspadini Herman Mawengkang Syahril Efendi

Year: 2020 Pages: 226-231

DOI: 10.1109/mecnit48290.2020.9166612

Get Full-Text PDF Get Analytical Report

Abstract

Data mining was the data processing technique to be obtained knowledge or important pattern of data. One of the popular methods was the KNN (K-Nearest Neighbor) which was computational simplicity. There was some weakness of KNN, vulnerable in the data high dimensionality. It was caused of data high dimensionality, so that space can be occupied in instance to be greater. The method approach would be proposed in the research was the method to be omitted several numbers of features were irrelevant to the KNN method by using similarity measures (Euclidean distance, Correlation distance and Cosine similarity). The testing was done by testing the different three datasets and computed the average of accurate results. The results of testing have successes to be omitted the data features without decreasing accuracy, that the accuracy average of feature selection using the KNN algorithm: KNN without selection was 88.804%, KNN-Euclidean distance was 89.120%, KNN-Correlation distance is 89.567% and KNN-Cosine similarity was 89.134%.

Keywords:

k-nearest neighbors algorithm Euclidean distance Pattern recognition (psychology) Similarity (geometry) Artificial intelligence Curse of dimensionality Computer science Feature selection Cosine similarity Nearest neighbor search Correlation Distance measures Selection (genetic algorithm) Feature (linguistics) Dimensionality reduction Similarity measure Data mining Mathematics Image (mathematics)

Metrics

Cited By

1.13

FWCI (Field Weighted Citation Impact)

Refs

0.84

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Data Mining and Machine Learning Applications

Physical Sciences → Computer Science → Information Systems

Data Mining Algorithms and Applications

Physical Sciences → Computer Science → Information Systems

Information Retrieval and Data Mining

Physical Sciences → Computer Science → Information Systems

Feature Selection on K-Nearest Neighbor Algorithm Using Similarity Measure

Abstract

Metrics

Citation History

Topics

Related Documents

Malware Detection Using K-Nearest Neighbor Algorithm and Feature Selection

K nearest neighbor for text summarization using feature similarity

Classifying News Articles Using Feature Similarity K Nearest Neighbor

Semantic Word Categorization using Feature Similarity based K Nearest Neighbor

Text Classification Using K-Nearest Neighbor Algorithm and Firefly Algorithm for Text Feature Selection