JOURNAL ARTICLE

Improved Measures of Redundancy and Relevance for mRMR Feature Selection

Insik JoSang-Bum LeeSejong Oh

Year: 2019 Journal:   Computers Vol: 8 (2)Pages: 42-42   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Many biological or medical data have numerous features. Feature selection is one of the data preprocessing steps that can remove the noise from data as well as save the computing time when the dataset has several hundred thousand or more features. Another goal of feature selection is improving the classification accuracy in machine learning tasks. Minimum Redundancy Maximum Relevance (mRMR) is a well-known feature selection algorithm that selects features by calculating redundancy between features and relevance between features and class vector. mRMR adopts mutual information theory to measure redundancy and relevance. In this research, we propose a method to improve the performance of mRMR feature selection. We apply Pearson’s correlation coefficient as a measure of redundancy and R-value as a measure of relevance. To compare original mRMR and the proposed method, features were selected using both of two methods from various datasets, and then we performed a classification test. The classification accuracy was used as a measure of performance comparison. In many cases, the proposed method showed higher accuracy than original mRMR.

Keywords:
Minimum redundancy feature selection Feature selection Redundancy (engineering) Computer science Artificial intelligence Pattern recognition (psychology) Preprocessor Relevance (law) Data mining Support vector machine Mutual information Measure (data warehouse) Data pre-processing Feature (linguistics) Machine learning

Metrics

88
Cited By
2.27
FWCI (Field Weighted Citation Impact)
23
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Gene expression and cancer classification
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Machine Learning in Bioinformatics
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Bioinformatics and Genomic Networks
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
© 2026 ScienceGate Book Chapters — All rights reserved.