Naive Bayes (NB for short) is one of the popular methods for supervised classification in a knowledge management system. Currently, in many real-world applications, high-dimensional data pose a major challenge to conventional NB classifiers, due to noisy or redundant features and local relevance of these features to classes. In this paper, an automated feature weighting solution is proposed to result in a NB method effective in dealing with high-dimensional data. We first propose a locally weighted probability model, for Bayesian modeling in high-dimensional spaces, to implement a soft feature selection scheme. Then we propose an optimization algorithm to find the weights in linear time complexity, based on the Logitnormal priori distribution and the Maximum a Posteriori principle. Experimental studies show the effectiveness and suitability of the proposed model for high-dimensional data classification.
Sang‐Bum KimHee-Cheol SeoHae‐Chang Rim
Qiaowei JiangWen WangHan XuShasha ZhangXinyan WangCong Wang
Hui ChenLifei ChenQingshan JiangShun Guo
Liangxiao JiangChaoqun LiShasha WangLungan Zhang
Jia WuShirui PanXingquan ZhuZhihua CaiPeng ZhangChengqi Zhang