JOURNAL ARTICLE

Unsupervised Representative Feature Selection Algorithm Based on Information Entropy and Relevance Analysis

Yintong Wang

Year: 2018 Journal:   IEEE Access Vol: 6 Pages: 45317-45324   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Feature selection plays an important role in preprocessing in pattern recognition and data mining, especially in large scale image, digital text, and biological data. Specifically, class label information is unavailable for conducting the selection of a minimal feature subset in unsupervised learning, which is full of challenging and interesting problems. In this paper, we present an unsupervised REPresentative Feature Selection (REPFS) algorithm based on information entropy and relevance analysis. The proposed method seeks to find a high-quality feature subset through feature clustering without using any learning algorithms. More importantly, the features' relevance will be computed based on an information metric of the relevance gain, which provides an information theoretical foundation for finding a minimum of the redundancy between features. Our results on nine benchmark data sets demonstrate that REPFS can significantly improve upon state-of-the-art unsupervised algorithms.

Keywords:
Feature selection Computer science Artificial intelligence Pattern recognition (psychology) Cluster analysis Minimum redundancy feature selection Entropy (arrow of time) Unsupervised learning Preprocessor Information gain Relevance (law) Feature (linguistics) Data mining Machine learning Redundancy (engineering) Feature learning Data pre-processing

Metrics

9
Cited By
0.43
FWCI (Field Weighted Citation Impact)
37
Refs
0.62
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.