JOURNAL ARTICLE

A Comparative study on text categorization

Abstract

Automated text categorization is a supervised learning task, defined as assigning category labels to new documents based on likelihood suggested by a training set of labeled documents. Two examples of methodology for text categorizations are Naive Bayes and K-Nearest Neighbor. In this thesis, we implement two categorization engines based on Naive Bayes and K-Nearest Neighbor methodology. We then compare the effectiveness of these two engines by calculating standard precision and recall for a collection of documents. We will further report on time efficiency of these two engines.

Keywords:
Categorization Natural language processing Computer science Artificial intelligence Linguistics Philosophy

Metrics

6
Cited By
0.29
FWCI (Field Weighted Citation Impact)
2
Refs
0.61
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Spam and Phishing Detection
Physical Sciences →  Computer Science →  Information Systems
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

A comparative study on text representation schemes in text categorization

Fengxi SongShuhai LiuJingyu Yang

Journal:   Pattern Analysis and Applications Year: 2005 Vol: 8 (1-2)Pages: 199-209
JOURNAL ARTICLE

A comparative study on text representation schemes in text categorization

SongFengxiLiuShuhaiYangJing-Yu

Journal:   Pattern Analysis and Applications Year: 2005
JOURNAL ARTICLE

Comparative Study on Feature Selection in Uighur Text Categorization

Yong YangX. JianDong HuaXiao Li

Journal:   INTERNATIONAL JOURNAL ON Advances in Information Sciences and Service Sciences Year: 2012 Vol: 4 (3)Pages: 19-26
© 2026 ScienceGate Book Chapters — All rights reserved.