Learning on Vertically Partitioned Data based on Chi-square Feature Selection and Naive Bayes Classification

Verónica Bolón‐Canedo; Diego Peteiro-Barral; Amparo Alonso‐Betanzos; Bertha Guijarro‐Berdiñas; Noelia Sánchez‐Maroño

doi:10.5220/0004759503500357

ScienceGate Book Chapters

JOURNAL ARTICLE

Learning on Vertically Partitioned Data based on Chi-square Feature Selection and Naive Bayes Classification

Verónica Bolón‐Canedo Diego Peteiro-Barral Amparo Alonso‐Betanzos Bertha Guijarro‐Berdiñas Noelia Sánchez‐Maroño

Year: 2014 Pages: 350-357

DOI: 10.5220/0004759503500357

Get Full-Text PDF Get Analytical Report

Abstract

In the last few years, distributed learning has been the focus of much attention due to the explosion of big databases, in some cases distributed across different nodes. However, the great majority of current selection and classification algorithms are designed for centralized learning, i.e. they use the whole dataset at once. In this paper, a new approach for learning on vertically partitioned data is presented, which covers both feature selection and classification. The approach splits the data by features, and then uses the chi-square filter and the naive Bayes classifier to learn at each node. Finally, a merging procedure is performed, which updates the learned model in an incremental fashion. The experimental results on five representative datasets show that the execution time is shortened considerably whereas the classification performance is maintained as the number of nodes increases.

Keywords:

Computer science Naive Bayes classifier Feature selection Machine learning Artificial intelligence Classifier (UML) Focus (optics) Data mining Selection (genetic algorithm) Statistical classification Filter (signal processing) Bayes classifier Pattern recognition (psychology) Support vector machine

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.27

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Face and Expression Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Machine Learning and ELM

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Clustering Algorithms Research

Physical Sciences → Computer Science → Artificial Intelligence

Learning on Vertically Partitioned Data based on Chi-square Feature Selection and Naive Bayes Classification

Abstract

Metrics

Citation History

Topics

Related Documents

Classification Email Spam using Naive Bayes Algorithm and Chi-Squared Feature Selection

Learning Vector Quantization for Diabetes Data Classification with Chi-Square Feature Selection

Chi-Square Feature Selection Effect On Naive Bayes Classifier Algorithm Performance For Sentiment Analysis Document

Klasifikasi Penyakit Stroke Menggunakan Metode Naïve Bayes Classification dan Chi-Square Feature Selection

Feature selection for multi-label naive Bayes classification