JOURNAL ARTICLE

EFFICIENT DESIGN OF FAST FEATURE SUBSET SELECTION ALGORITHM FOR MULTI-DIMENSIONAL DATA BASED ON CLUSTERING

Meghana SatishT. Bhavana BhatM V TrupthiKaushal VishuC N Chinnaswamy

Year: 2015 Journal:   Zenodo (CERN European Organization for Nuclear Research)   Publisher: European Organization for Nuclear Research

Abstract

Abstract Feature selection for clustering of high dimensional data clustering is a difficult problem because of the broad number of redundant and irrelevant featured that the data can have that can run the clustering. A weighting scheme is proposed, wherein the weight for each feature is measured by its contribution to the given clustering task.Two different steps are proposed in the algorithm In the first step features are divided into clusters using graph-theoretic clustering methods. In the second step,feature subsets are formed by the features that are strongly related to target classes. Feature subset selection research is focuses on searching for relevant features. The proposed logic on minimizes redundant data set and improves the feature subset accuracy. Efficient minimum-spanning tree (MST) clustering method to ensure efficiency of proposed algorithm, is adopted in the algorithm. Extensive experiments are carried out to compare the proposed algorithm and several other feature selection algorithms, namely, Relief, FCBF, CFS, Consist, and FOCUS-SF .The results, demonstrate that the algorithm not only produces smaller subsets of features but also improves the performances of the four types of classifiers. Keywords- Feature subset selection, filter technique, feature clustering, graph-based clustering.

Keywords:
Cluster analysis Feature (linguistics) Feature selection Pattern recognition (psychology) Minimum redundancy feature selection Correlation clustering CURE data clustering algorithm Canopy clustering algorithm Weighting

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.28
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Machine Learning and Data Classification
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data

Qinbao SongJingjie NiGuangtao Wang

Journal:   IEEE Transactions on Knowledge and Data Engineering Year: 2011 Vol: 25 (1)Pages: 1-14
JOURNAL ARTICLE

Mining of High Dimensional Data using Efficient Feature Subset Selection Clustering Algorithm (WEKA)

Lakshmi SarikaTCh. SamsonuB. Tarakeswara RaoB. Satyanarayana Reddy

Journal:   International Journal of Computer Applications Year: 2014 Vol: 107 (6)Pages: 7-12
JOURNAL ARTICLE

Efficient Feature Subset Selection Algorithm for High Dimensional Data

Smita ChormungeSudarson Jena

Journal:   International Journal of Electrical and Computer Engineering (IJECE) Year: 2016 Vol: 6 (4)Pages: 1880-1880
© 2026 ScienceGate Book Chapters — All rights reserved.