Automated text categorization is a supervised learning task, defined as assigning category labels to new documents based on likelihood suggested by a training set of labeled documents. Two examples of methodology for text categorizations are Naive Bayes and K-Nearest Neighbor. In this thesis, we implement two categorization engines based on Naive Bayes and K-Nearest Neighbor methodology. We then compare the effectiveness of these two engines by calculating standard precision and recall for a collection of documents. We will further report on time efficiency of these two engines.
Fengxi SongShuhai LiuJingyu Yang
SongFengxiLiuShuhaiYangJing-Yu
Ismail HmeidiMahmoud Al‐AyyoubNawaf AbdullaAbdalrahman A. AlmodawarRaddad AbooraigNizar A. Mahyoub
Yong YangX. JianDong HuaXiao Li
Zhihong DengShiwei TangD. YangMing Zhang Li-Yu LiKunqing Xie