A K-Nearest Neighbor Algorithm based on cluster in text classification

Chunyan Wang; Kuo Zhang; Yu-Guang Yan; Jiangang Li

doi:10.1109/cmce.2010.5610477

ScienceGate Book Chapters

JOURNAL ARTICLE

A K-Nearest Neighbor Algorithm based on cluster in text classification

Chunyan Wang Kuo Zhang Yu-Guang Yan Jiangang Li

Year: 2010

DOI: 10.1109/cmce.2010.5610477

Get Full-Text PDF Get Analytical Report

Abstract

The K-Nearest Neighbor Algorithm (K-NN) is an important approach for automatic text classification. In this paper, cluster was applied In order to overcome the disadvantages of the traditional K-NN algorithm. First Clustering was utilized in training set through an improved K-mean approach to select the most representative samples as cluster center. Then we compute the comparability between the testing samples and the central vector of each cluster. A K-NN algorithm based on cluster was presented. The experiment results verify that this classification algorithm is much faster than the traditional K-NN algorithm, and it can raise the accuracy.

Keywords:

k-nearest neighbors algorithm Computer science Cluster analysis Comparability Cluster (spacecraft) Nearest-neighbor chain algorithm Set (abstract data type) Pattern recognition (psychology) Statistical classification Algorithm Artificial intelligence k-medoids Data mining Canopy clustering algorithm Fuzzy clustering Mathematics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.09

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Web Data Mining and Analysis

Physical Sciences → Computer Science → Information Systems

Machine Learning and Data Classification

Physical Sciences → Computer Science → Artificial Intelligence

A K-Nearest Neighbor Algorithm based on cluster in text classification

Abstract

Metrics

Citation History

Topics

Related Documents

A k-nearest neighbor text classification algorithm based on fuzzy integral

Novel Text Classification Based on K-Nearest Neighbor

The Decomposed K-Nearest Neighbor Algorithm for Imbalanced Text Classification

Improved K nearest neighbor classification algorithm

Fast k-nearest neighbor classification using cluster-based trees