JOURNAL ARTICLE

Automated feature weighting in naive bayes for high-dimensional data classification

Abstract

Naive Bayes (NB for short) is one of the popular methods for supervised classification in a knowledge management system. Currently, in many real-world applications, high-dimensional data pose a major challenge to conventional NB classifiers, due to noisy or redundant features and local relevance of these features to classes. In this paper, an automated feature weighting solution is proposed to result in a NB method effective in dealing with high-dimensional data. We first propose a locally weighted probability model, for Bayesian modeling in high-dimensional spaces, to implement a soft feature selection scheme. Then we propose an optimization algorithm to find the weights in linear time complexity, based on the Logitnormal priori distribution and the Maximum a Posteriori principle. Experimental studies show the effectiveness and suitability of the proposed model for high-dimensional data classification.

Keywords:
Weighting Computer science A priori and a posteriori Maximum a posteriori estimation Naive Bayes classifier Artificial intelligence Feature (linguistics) Feature selection Pattern recognition (psychology) Machine learning Data mining Bayesian probability Relevance (law) Bayes' theorem Bayesian programming Bayesian optimization Mathematics Bayesian hierarchical modeling Support vector machine Maximum likelihood Statistics

Metrics

31
Cited By
2.65
FWCI (Field Weighted Citation Impact)
32
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Bayesian Modeling and Causal Inference
Physical Sciences →  Computer Science →  Artificial Intelligence
Rough Sets and Fuzzy Logic
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.