JOURNAL ARTICLE

A Correlation-Based Feature Weighting Filter for Naive Bayes

Liangxiao JiangLungan ZhangChaoqun LiJia Wu

Year: 2018 Journal:   IEEE Transactions on Knowledge and Data Engineering Vol: 31 (2)Pages: 201-213   Publisher: IEEE Computer Society

Abstract

Due to its simplicity, efficiency, and efficacy, naive Bayes (NB) has continued to be one of the top 10 algorithms in the data mining and machine learning community. Of numerous approaches to alleviating its conditional independence assumption, feature weighting has placed more emphasis on highly predictive features than those that are less predictive. In this paper, we argue that for NB highly predictive features should be highly correlated with the class (maximum mutual relevance), yet uncorrelated with other features (minimum mutual redundancy). Based on this premise, we propose a correlation-based feature weighting (CFW) filter for NB. In CFW, the weight for a feature is a sigmoid transformation of the difference between the feature-class correlation (mutual relevance) and the average feature-feature intercorrelation (average mutual redundancy). Experimental results show that NB with CFW significantly outperforms NB and all the other existing state-of-the-art feature weighting filters used to compare. Compared to feature weighting wrappers for improving NB, the main advantages of CFW are its low computational complexity (no search involved) and the fact that it maintains the simplicity of the final model. Besides, we apply CFW to text classification and have achieved remarkable improvements.

Keywords:
Weighting Feature (linguistics) Artificial intelligence Computer science Redundancy (engineering) Pattern recognition (psychology) Mutual information Correlation Naive Bayes classifier Machine learning Data mining Filter (signal processing) Relevance (law) Algorithm Mathematics Support vector machine

Metrics

263
Cited By
18.47
FWCI (Field Weighted Citation Impact)
68
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Bayesian Modeling and Causal Inference
Physical Sciences →  Computer Science →  Artificial Intelligence
Rough Sets and Fuzzy Logic
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

A correlation-based feature weighting filter for multi-label Naive Bayes

Gurudatta VermaTirath Prasad Sahu

Journal:   International Journal of Information Technology Year: 2023 Vol: 16 (1)Pages: 611-619
JOURNAL ARTICLE

A Dependent Feature Weighting Filter for Naive Bayes Classifier

Gieliz ChatipFerkan Yılmaz

Journal:   2022 30th Signal Processing and Communications Applications Conference (SIU) Year: 2022 Vol: 7 Pages: 1-4
JOURNAL ARTICLE

Naive bayes-correlation based feature weighting technique for sports match result prediction

Manoj SharmaM. LambaNaresh KumarPardeep Kumar

Journal:   Evolutionary Intelligence Year: 2021 Vol: 15 (3)Pages: 2171-2186
JOURNAL ARTICLE

AN INFORMATION-THEORETIC FILTER METHOD FOR FEATURE WEIGHTING IN NAIVE BAYES

Chang‐Hwan Lee

Journal:   International Journal of Pattern Recognition and Artificial Intelligence Year: 2014 Vol: 28 (05)Pages: 1451007-1451007
© 2026 ScienceGate Book Chapters — All rights reserved.