Using K Nearest Neighbors for text segmentation with feature similarity

Taeho Jo

doi:10.1109/iccccee.2017.7866706

ScienceGate Book Chapters

JOURNAL ARTICLE

Using K Nearest Neighbors for text segmentation with feature similarity

Taeho Jo

Year: 2017 Vol: 11 Pages: 1-5

DOI: 10.1109/iccccee.2017.7866706

Get Full-Text PDF Get Analytical Report

Abstract

In this research, we propose the version of K Nearest Neighbor which considers similarity among attributes for computing the similarity between feature vectors. The text segmentation task is viewed into the binary classification where each pair of sentences or paragraphs is classified into whether we put the boundary or not, and the proposed version resulted in the successful results in previous works concerned with the text categorization and clustering. In this research, we define the similarity measure based on both attributes and values, modify the KNN using it, and apply the modified version into the text segmentation task. We may expect more compact representation of data items and improved performance in the text segmentation task as well as other tasks of text mining. Therefore, the goal of this research is to implement the text segmentation system which provides the benefits.

Keywords:

k-nearest neighbors algorithm Computer science Artificial intelligence Similarity (geometry) Pattern recognition (psychology) Feature (linguistics) Segmentation Nearest neighbor search Image segmentation Image (mathematics)

Metrics

Cited By

1.38

FWCI (Field Weighted Citation Impact)

Refs

0.84

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Spam and Phishing Detection

Physical Sciences → Computer Science → Information Systems

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Using K Nearest Neighbors for text segmentation with feature similarity

Abstract

Metrics

Citation History

Topics

Related Documents

K nearest neighbor for text summarization using feature similarity

Articulatory feature classification using nearest neighbors

Feature reduction for imbalanced data classification using similarity-based feature clustering with adaptive weighted K-nearest neighbors

Unsupervised Word Segmentation using K Nearest Neighbors

Weighted nearest neighbors feature selection