JOURNAL ARTICLE

Feature selection using hierarchical feature clustering

Abstract

One of the challenges in data mining is the dimensionality of data, which is often very high and prevalent in many domains, such as text categorization and bio-informatics. The high-dimensionality of data may bring many adverse situations to traditional learning algorithms. To cope with this issue, feature selection has been put forward. Currently, many efforts have been attempted in this field and lots of feature selection algorithms have been developed. In this paper we propose a new selection method to pick discriminative features by using information measurement. The main characteristic of our selection method is that the selection procedure works like feature clustering in a hierarchically agglomerative way, where each feature is considered as a cluster and the between-cluster and within-cluster distances are measured by mutual information and the coefficient of relevancy respectively. Consequently, the final aggregated cluster is the selection result, which has the minimal redundancy among its members and the maximal relevancy with the class labels. The simulation experiments on seven datasets show that the proposed method outperforms other popular feature selection algorithms in classification performance.

Keywords:
Feature selection Computer science Cluster analysis Artificial intelligence Redundancy (engineering) Data mining Discriminative model Dimensionality reduction Curse of dimensionality Hierarchical clustering Pattern recognition (psychology) Minimum redundancy feature selection Feature (linguistics) Selection (genetic algorithm) Machine learning Mutual information Categorization

Metrics

59
Cited By
1.79
FWCI (Field Weighted Citation Impact)
31
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Gene expression and cancer classification
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

A Feature Selection Method Using Hierarchical Clustering

Cheong Hee Park

Lecture notes in computer science Year: 2013 Pages: 1-6
JOURNAL ARTICLE

Feature selection for hierarchical clustering

Frederik QuestierBeata WalczakD.L. MassartC. BouconS. De Jong

Journal:   Analytica Chimica Acta Year: 2002 Vol: 466 (2)Pages: 311-324
BOOK-CHAPTER

Dynamic Feature Selection in Incremental Hierarchical Clustering

Luis Talavera

Lecture notes in computer science Year: 2000 Pages: 392-403
JOURNAL ARTICLE

Hierarchical clustering: Visualization, feature importance and model selection

Luben M. C. CabezasRafael IzbickiRafael B. Stern

Journal:   Applied Soft Computing Year: 2023 Vol: 141 Pages: 110303-110303
JOURNAL ARTICLE

Hierarchical feature selection with multi-granularity clustering structure

Shunxin GuoHong ZhaoWenyuan Yang

Journal:   Information Sciences Year: 2021 Vol: 568 Pages: 448-462
© 2026 ScienceGate Book Chapters — All rights reserved.