Rajshree JodhaGaur Sanjay B.CK. R. Chowdhary
This paper presents a fast and efficient approach for text classification using KNN for different feature selection method. Typically, this approach evaluates the performance of the system for minimum number of features required to classify the text documents. 20 Newsgroup dataset collected by Ken Lang, have been taken to check performance of the KNN classifier algorithm. The above dataset is separated into two parts viz. training set (60%) and test set (40%). The KNN classifier has been implemented against the different number of stemmed and unstemmed features for CHI (Chi-Squared Statistic), IG (Information Gain) and MI (Mutual Information). The Accuracy, Precision, Recall and F1-Score are used to test the system.
Rajshree JodhaGaur Sanjay BcK. R. ChowdharyAmit Kumar Mishra
Anirban DasguptaPetros DrineasBoulos HarbVanja JosifovskiMichael W. Mahoney
İlhami SELCelalettin YeroğluDavut Hanbay
Abdulmohsen Al-ThubaityNorah AbanumaySara Al-JerayyedAljoharah AlrukbanZarah M. Mannaa