Computer Engineering and Applications ›› 2020, Vol. 56 ›› Issue (13): 63-71.DOI: 10.3778/j.issn.1002-8331.1906-0070

Previous Articles     Next Articles

Partial Iterative Fast K-means Clustering Algorithm

LI Feng, LI Mingxiang, ZHANG Yujing   

  1. 1.Information Management and Engineering Department, Hebei Finance University, Baoding, Hebei 071051, China
    2.Applied Technology Research and Development Center Wisdom Finance in Hebei University, Baoding, Hebei 071051, China
  • Online:2020-07-01 Published:2020-07-02



  1. 1.河北金融学院 信息管理与工程系,河北 保定 071051
    2.河北省高校智慧金融应用技术研发中心,河北 保定 071051


The K-means algorithm is one of the most popular and widely spread clustering methods. But it is not always possibly to find the appropriate initial value of the cluster centers, especially when the number of clusters is increased. If it can’t find suitable initial values, that will affect the clustering effect. This paper proposes an iterative approach to improve the quality of the clustering. This method called Partial Iterative Fast K-means plus-minus(PIFKM+?). Based on the K-means clustering, the algorithm divides a cluster and removes another one, then re-clusters the affected data, in each iteration. The algorithm reduces the time complexity and improves the effect of clustering. The proposed method has the advantages of being able to update clusters quickly, is insensitive to initial values of cluster centers, and can improve clustering accuracy in the face of a large number of clusters. By comparing with the K-means and K-means++, experimental results vividly demonstrate that the algorithm has better clustering effect, higher operating efficiency and scalability on the simulation data sets and the real data sets. Through the statistical analysis of the final experimental results, it is shown that the PIFKM+? algorithm does not lose too much time efficiency while improving clustering accuracy.

Key words: K-means algorithm, cluster segmentation, cluster removing, partial iterative clustering, cluster neighbor



关键词: K-means算法, 聚类分割, 聚类删除, 局部迭代聚类, 聚类邻居