JOURNAL ARTICLE

A Wasserstein Distance-Based Cost-Sensitive Framework for Imbalanced Data Classification

Ruyi FengHua JiZhenfeng ZhuL. Wang

Year: 2023 Journal:   Radioengineering Vol: 32 (3)Pages: 451-466   Publisher: Spolecnost pro radioelektronicke inzenyrstvi

Abstract

Class imbalance is a prevalent problem in many real-world applications, and imbalanced data distribution can dramatically skew the performance of classifiers. In general, the higher the imbalance ratio of a dataset, the more difficult it is to classify. However, it is found that standard classifiers can still achieve good classification results on some highly imbalanced datasets. Obviously, the class imbalance is only a superficial characteristic of the data, and the underlying structural information is often the key factor affecting the classification performance. As implicit prior knowledge, structural information has been validated to be crucial for designing a good classifier. This paper proposes a Wasserstein-based cost-sensitive support vector machine (CS-WSVM) for class imbalance learning, incorporating prior structural information and a cost-sensitive strategy. The Wasserstein distance is introduced to model the distribution of majority and minority samples to capture the structural information, which is employed to weight the majority and minority samples. Comprehensive experiments on synthetic and real-world datasets, especially on the radar emitter signal dataset, demonstrated that CS-WSVM can achieve outstanding performance in imbalanced scenarios.

Keywords:
Computer science Data mining Pattern recognition (psychology) Mathematics Artificial intelligence

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
45
Refs
0.10
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

A Measure Optimized Cost-Sensitive Learning Framework for Imbalanced Data Classification

Peng CaoOsmar R. Zaı̈aneDazhe Zhao

Advances in data mining and database management book series Year: 2014 Pages: 48-75
JOURNAL ARTICLE

Cost-sensitive boosting for classification of imbalanced data

Yanmin SunMohamed S. KamelAndrew K. C. WongYang Wang

Journal:   Pattern Recognition Year: 2007 Vol: 40 (12)Pages: 3358-3378
BOOK-CHAPTER

A PSO-Based Cost-Sensitive Neural Network for Imbalanced Data Classification

Peng CaoDazhe ZhaoOsmar R. Zaı̈ane

Lecture notes in computer science Year: 2013 Pages: 452-463
JOURNAL ARTICLE

Imbalanced Data Classification Based on Cost-Sensitive Large Margin Distribution Machine

Yaxi CaoHaiyan Huang

Journal:   华东理工大学学报(自然科学版) Year: 2019 Vol: 45 (4)Pages: 606-613
© 2026 ScienceGate Book Chapters — All rights reserved.