JOURNAL ARTICLE

Ensemble learning based predictive modelling on a highly imbalanced multiclass data

Manka VastiAmita Dev

Year: 2024 Journal:   Journal of Information and Optimization Sciences Vol: 45 (8)Pages: 2141-2164   Publisher: Taylor & Francis

Abstract

Class imbalance in the real-world datasets is a big challenge and the domains such as fraud detection, calamity occurrences, bankruptcy prediction etc. are prone to class imbalance due to the nature of occurrences of the events. In this paper, the detailed research using six ensemble machine learning techniques is applied to the undersampled, oversampled and the original dataset and the results are compared. The results of the research study indicates that amongst the applied six ensemble learners, the best learner is Random Forest algorithm (with entropy gain) implemented using ten-fold cross validation on the SMOTE oversampled dataset. 0.95 AUC and 0.8689 accuracy i.e. an increase of 4% in accuracy and substantial increase in other performance indicators is observed as compared to the remaining five ensemble classifiers.

Keywords:
Ensemble learning Computer science Machine learning Artificial intelligence Ensemble forecasting

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.35
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Artificial Intelligence in Healthcare
Health Sciences →  Health Professions →  Health Information Management
© 2026 ScienceGate Book Chapters — All rights reserved.