JOURNAL ARTICLE

An Effective Online Stream Feature Selection Auxiliary Method for High-Dimensional Unbalanced Data

Abstract

In the area of feature selection from highdimensional data, online streaming feature selection methods have received extensive attention in the past few decades due to their online selection abilities. Existing online stream feature selection methods perform well on many balanced datasets, But the real datasets are usually high-dimensional and unbalanced. For example, in medical examination data, the proportion of the sick people is much smaller than that of the healthy people. In the face of unbalanced data, traditional stream feature selection algorithms confront problems such as few selected features and low classification accuracy. Therefore, how to perform online stream feature selection under high-dimensional and unbalanced conditions is a challenge. In this paper, a general and easy-toimplement auxiliary algorithm is proposed, which can supplement the existing stream feature selection methods and dig out feature subsets effectively. Finally, the experiments are carried out on seven high-dimensional and unbalanced datasets and the results show that the auxiliary method can improve the traditional online stream feature selection methods and enable the classifiers to achieve better classification performance.

Keywords:
Feature selection Computer science Selection (genetic algorithm) Data stream Data mining Feature (linguistics) Artificial intelligence Feature extraction Pattern recognition (psychology) Machine learning

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
16
Refs
0.09
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

High Dimensional Unbalanced Data Classification Vs SVM Feature Selection

Chinna Gopi SimhadriB. SuvarnaT. Maruthi Padmaja

Journal:   Indian Journal of Science and Technology Year: 2016 Vol: 9 (30)
JOURNAL ARTICLE

Feature selection method for High Dimensional Data

Journal:   International Journal of Modern Trends in Engineering & Research Year: 2016 Vol: 3 (10)Pages: 190-203
JOURNAL ARTICLE

Online feature selection for high-dimensional class-imbalanced data

Peng ZhouXuegang HuPeipei LiXindong Wu

Journal:   Knowledge-Based Systems Year: 2017 Vol: 136 Pages: 187-199
JOURNAL ARTICLE

Online streaming feature selection for high-dimensional small-sample data

Kuangfeng GongGuohe LiLingyun GuoYaojin Lin

Journal:   International Journal of Machine Learning and Cybernetics Year: 2024 Vol: 16 (4)Pages: 2705-2719
© 2026 ScienceGate Book Chapters — All rights reserved.