Chaloemphon SirikayonArit Thammano
Clustering by the k-means is the most widely used method because of its ease of use. But the disadvantage of the k-means algorithm is that it relies on a random initialization. Therefore, the results obtained from each clustering are not stable depending on the starting point, affecting the results obtained in other applications. This paper, therefore, presents a method for determining the initialization of the k-means algorithm using the Data Distribution Guide (DDG). And use it as an aid in determining the starting point without random. Make the results of clustering always equal. And from the experimental results, We found that the accuracy obtained from clustering using the initialization from this method was good. Compared to the commonly used initialization designation.
M. EMRE CELEBIHASSAN A. KINGRAVI
Farid AhmatshinLev Kazakovtsev
M. Emre CelebiHassan A. Kingravi
Xu JunlingBaowen XuWeifeng ZhangWei ZhangJun Hou
Jiewei LuJiaowei TangZhenmin TangJingyu Yang