S. Santha SubbulaxmiG. Arumugam
Imbalanced data classification is a critical and challenging problem in both data mining and machine learning. Imbalanced data classification problems present in many application areas like rare medical diagnosis, risk management, fault-detection, etc. The traditional classification algorithms yield poor results in imbalanced classification problems. In this paper, K-Means cluster based undersampling ensemble algorithm is proposed to solve the imbalanced data classification problem. The proposed method combines K-Means cluster based undersampling and boosting method. The experimental results show that the proposed algorithm outperforms the other sampling ensemble algorithms of previous studies.
Parinaz SobhaniHerna L. ViktorStan Matwin
Wing W. Y. NgShichao XuJianjun ZhangXing TianTongwen RongSam Kwong
Qingyan YinJiangshe ZhangChunxia ZhangNannan Ji