JOURNAL ARTICLE

Imbalanced classification using genetically optimized cost sensitive classifiers

Abstract

Classification is one of the most researched problems in machine learning, since the 1960s a myriad of different techniques have been proposed. The purpose of a classification algorithm, also known as a 'classifier', is to identify what class, or category an observation belongs to. In many real-world scenarios, datasets tend to suffer from class imbalance, where the number of observations belonging to one class greatly outnumbers that of the observations belonging to other classes. Class imbalance has been shown to hinder the performance of classifiers, and several techniques have been developed to improve the performance of imbalanced classifiers. Using a cost matrix is one such technique for dealing with class imbalance, however it requires a matrix to be either pre-defined, or manually optimized. This paper proposes an approach for automatically generating optimized cost matrices using a genetic algorithm. The genetic algorithm can generate matrices for classification problems with any number of classes, and is easy to tailor towards specific use-cases. The proposed approach is compared against unoptimized classifiers and alternative cost matrix optimization techniques using a variety of datasets. In addition to this, storage system failure prediction datasets are provided by Seagate UK, the potential of these datasets is investigated.

Keywords:
Computer science Artificial intelligence Pattern recognition (psychology) Machine learning Random subspace method Support vector machine

Metrics

26
Cited By
3.46
FWCI (Field Weighted Citation Impact)
30
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Machine Learning and Data Classification
Physical Sciences →  Computer Science →  Artificial Intelligence
Electricity Theft Detection Techniques
Physical Sciences →  Engineering →  Electrical and Electronic Engineering

Related Documents

BOOK-CHAPTER

A Measure Optimized Cost-Sensitive Learning Framework for Imbalanced Data Classification

Peng CaoOsmar R. Zaı̈aneDazhe Zhao

Advances in data mining and database management book series Year: 2014 Pages: 48-75
BOOK-CHAPTER

Training Cost-Sensitive Classifiers to Tackle Imbalanced Data

Utkarsh KejriwalKomal Arora

Advances in Science, Technology & Innovation/Advances in science, technology & innovation Year: 2025 Pages: 67-71
© 2026 ScienceGate Book Chapters — All rights reserved.