计算机工程与应用 ›› 2013, Vol. 49 ›› Issue (19): 153-157.

• 图形图像处理 • 上一篇    下一篇

一种新的k-medoids聚类算法

姚丽娟,罗  可,孟  颖   

  1. 长沙理工大学 计算机与通信工程学院,长沙 410114
  • 出版日期:2013-10-01 发布日期:2015-04-20

New k-medoids clustering algorithm

YAO Lijuan, LUO Ke, MENG Ying   

  1. Institute of Computer and Communication Engineering, Changsha University of Sciences and Technology, Changsha 410114, China
  • Online:2013-10-01 Published:2015-04-20

摘要: 针对k-medoids算法对初始聚类中心敏感,聚类精度较低及收敛速度缓慢的缺点,提出一种基于密度初始化、密度迭代的搜索策略和准则函数优化的方法。该算法初始化是在高密度区域内选择k个相对距离较远的样本作为聚类初始中心,有效定位聚类的最终中心点;在k个与初始中心点密度相近的区域内进行中心点替换,以减少候选点的搜索范围;采用类间距和类内距加权的均衡化准则函数,提高聚类精度。实验结果表明,相对于传统的k-mediods算法及某些改进算法,该算法可以提高聚类质量,有效缩短聚类时间。

关键词: 聚类, k-medoids算法, 密度初始化, 目标函数

Abstract: For the disadvantages that sensitivity to centers initialization, lower clustering accuracy and slow convergent speed of k-medoids algorithm, a novel k-medoids algorithm based on density initialization, density of iterative search strategy and optimization criterion function is proposed. The Initialization of the algorithm is that, it chooses k cluster centers in the high-density area which are far apart, effectively positioning of the final cluster center. To replace the centers are in the ranges which are proximity to the k-initial centers, to reduce the scope of the search candidate point. Criterion function of equalization based on class density and within-class density weighted is adopted to improve the clustering precision. Experimental results show that this algorithm can improve the clustering quality, shorten the clustering time compared with traditional k-medoids algorithms or other improved algorithms.

Key words: clustering, k-medoids algorithm, density initialization, criterion function