A Comparative study on text categorization

Aditya Chainulu Karamcheti

doi:10.34917/1563704

ScienceGate Book Chapters

JOURNAL ARTICLE

A Comparative study on text categorization

Aditya Chainulu Karamcheti

Year: 2020

DOI: 10.34917/1563704

Get Full-Text PDF Get Analytical Report

Abstract

Automated text categorization is a supervised learning task, defined as assigning category labels to new documents based on likelihood suggested by a training set of labeled documents. Two examples of methodology for text categorizations are Naive Bayes and K-Nearest Neighbor. In this thesis, we implement two categorization engines based on Naive Bayes and K-Nearest Neighbor methodology. We then compare the effectiveness of these two engines by calculating standard precision and recall for a collection of documents. We will further report on time efficiency of these two engines.

Keywords:

Categorization Natural language processing Computer science Artificial intelligence Linguistics Philosophy

Metrics

Cited By

0.29

FWCI (Field Weighted Citation Impact)

Refs

0.61

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Spam and Phishing Detection

Physical Sciences → Computer Science → Information Systems

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

A Comparative study on text categorization

Abstract

Metrics

Citation History

Topics

Related Documents

A comparative study on text representation schemes in text categorization

A comparative study on text representation schemes in text categorization

Automatic Arabic text categorization: A comprehensive comparative study

Comparative Study on Feature Selection in Uighur Text Categorization

A Comparative Study on Feature Weight in Text Categorization