JOURNAL ARTICLE

Learning on Vertically Partitioned Data based on Chi-square Feature Selection and Naive Bayes Classification

Abstract

In the last few years, distributed learning has been the focus of much attention due to the explosion of big databases, in some cases distributed across different nodes. However, the great majority of current selection and classification algorithms are designed for centralized learning, i.e. they use the whole dataset at once. In this paper, a new approach for learning on vertically partitioned data is presented, which covers both feature selection and classification. The approach splits the data by features, and then uses the chi-square filter and the naive Bayes classifier to learn at each node. Finally, a merging procedure is performed, which updates the learned model in an incremental fashion. The experimental results on five representative datasets show that the execution time is shortened considerably whereas the classification performance is maintained as the number of nodes increases.

Keywords:
Computer science Naive Bayes classifier Feature selection Machine learning Artificial intelligence Classifier (UML) Focus (optics) Data mining Selection (genetic algorithm) Statistical classification Filter (signal processing) Bayes classifier Pattern recognition (psychology) Support vector machine

Metrics

2
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.27
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Machine Learning and ELM
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.