Multi-Objective Feature Selection With Missing Data in Classification

Yu Xue; Yihang Tang; Xin Xu; Jiayu Liang; Ferrante Neri

doi:10.1109/tetci.2021.3074147

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-Objective Feature Selection With Missing Data in Classification

Yu Xue Yihang Tang Xin Xu Jiayu Liang Ferrante Neri

Year: 2021 Journal: IEEE Transactions on Emerging Topics in Computational Intelligence Vol: 6 (2)Pages: 355-364 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tetci.2021.3074147

Get Full-Text PDF Get Analytical Report

Abstract

Feature selection (FS) is an important research topic in machine learning.\nUsually, FS is modelled as a+ bi-objective optimization problem whose\nobjectives are: 1) classification accuracy; 2) number of features. One of the\nmain issues in real-world applications is missing data. Databases with missing\ndata are likely to be unreliable. Thus, FS performed on a data set missing some\ndata is also unreliable. In order to directly control this issue plaguing the\nfield, we propose in this study a novel modelling of FS: we include reliability\nas the third objective of the problem. In order to address the modified\nproblem, we propose the application of the non-dominated sorting genetic\nalgorithm-III (NSGA-III). We selected six incomplete data sets from the\nUniversity of California Irvine (UCI) machine learning repository. We used the\nmean imputation method to deal with the missing data. In the experiments,\nk-nearest neighbors (K-NN) is used as the classifier to evaluate the feature\nsubsets. Experimental results show that the proposed three-objective model\ncoupled with NSGA-III efficiently addresses the FS problem for the six data\nsets included in this study.\n

Keywords:

Missing data Feature selection Computer science Imputation (statistics) Sorting Data mining Classifier (UML) Genetic algorithm Data set Artificial intelligence Machine learning Feature (linguistics) Pattern recognition (psychology) Algorithm

Metrics

142

Cited By

20.01

FWCI (Field Weighted Citation Impact)

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Multi-Objective Optimization Algorithms

Physical Sciences → Computer Science → Computational Theory and Mathematics

Metaheuristic Optimization Algorithms Research

Physical Sciences → Computer Science → Artificial Intelligence

Evolutionary Algorithms and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Multi-Objective Feature Selection With Missing Data in Classification

Abstract

Metrics

Citation History

Topics

Related Documents

A Wrapper Feature Selection Approach to Classification with Missing Data

Multi-task optimisation for multi-objective feature selection in classification

Multi-objective Optimisation-Based Feature Selection for Multi-label Classification

Causal Feature Selection with Missing Data

A multi-objective evolutionary feature selection approach for the classification of multi-label data