JOURNAL ARTICLE

Probability density function estimation based over-sampling for imbalanced two-class problems

Abstract

A novel probability density function (PDF) estimation based over-sampling approach is proposed for two-class imbalanced classification problems. The Parzen-window kernel function is applied to estimate the PDF of the positive class, from which synthetic instances are generated as additional training data to re-balance the class distribution. Utilising the re-balanced over-sampled training data, a radial basis function (RBF) classifier is constructed by applying an orthogonal forward regression, in which the classifier's structure and the parameters of RBF kernels are determined using a particle swarm optimisation algorithm based on the criterion of minimising the leave-one-out misclassification rate. The effectiveness of the proposed approach is demonstrated by an empirical study on several imbalanced data sets.

Keywords:
Kernel density estimation Probability density function Pattern recognition (psychology) Artificial intelligence Radial basis function Classifier (UML) Computer science Density estimation Kernel (algebra) Mathematics Statistics Artificial neural network

Metrics

26
Cited By
3.79
FWCI (Field Weighted Citation Impact)
53
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.