JOURNAL ARTICLE

Efficient parallel spectral clustering algorithm design for large data sets under cloud computing environment

Ran JinChunhai KouRuijuan LiuYefeng Li

Year: 2013 Journal:   Journal of Cloud Computing Advances Systems and Applications Vol: 2 (1)Pages: 18-18   Publisher: Springer Nature

Abstract

Spectral clustering algorithm has proved be more effective than most traditional algorithms in finding clusters. However, its high computational complexity limits its effect in actual application. This paper combines the spectral clustering with MapReduce, through evaluation of sparse matrix eigenvalue and computation of distributed cluster, puts forward the improvement ideas and concrete realization, and thus improves the clustering speed of the distinctive clustering algorithm. According to the experiment, with the processing data scale being enlarged, the clustering rate is in nearly linear growth, and the proposed parallel spectral clustering algorithm is suitable for large data mining. The research results provide research basis to better design a clustering partition algorithm in large data and high efficiency.

Keywords:
Cluster analysis Computer science CURE data clustering algorithm Correlation clustering Data stream clustering Canopy clustering algorithm Spectral clustering Algorithm Cloud computing Computation Partition (number theory) Fuzzy clustering Data mining Clustering high-dimensional data Artificial intelligence Mathematics

Metrics

24
Cited By
9.15
FWCI (Field Weighted Citation Impact)
17
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Computing and Algorithms
Social Sciences →  Social Sciences →  Urban Studies
Physical Activity and Education Research
Physical Sciences →  Environmental Science →  Water Science and Technology

Related Documents

BOOK-CHAPTER

Research on Fuzzy Clustering Algorithms for Large Dimensional Data Sets Under Cloud Computing

Shuang-cheng JiaFengping Yang

Lecture notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering Year: 2021 Pages: 295-305
JOURNAL ARTICLE

Parallel computing algorithms and efficient data mining model design for large-scale data sets

Wei LiDongliang Wang

Journal:   IET conference proceedings. Year: 2025 Vol: 2025 (25)Pages: 24-29
JOURNAL ARTICLE

Efficient parallel viterbi algorithm for big data in a spark cloud computing environment

Imad SassiOumaima RedaSamir AnterAhmed Zellou

Journal:   Procedia Computer Science Year: 2022 Vol: 215 Pages: 937-946
© 2026 ScienceGate Book Chapters — All rights reserved.