K-means Clustering Algorithm Combining Max-Min Distance and Weighted Density

doi:10.3778/j.issn.1002-8331.1907-0334

Abstract

Abstract:

Both the random selection of initial clustering center and the empirical determination of [K] value have a certain impact on [K]-means clustering results. A [K]-means clustering algorithm based on weighted density and max-min distance is proposed. The clustering center set is selected by using the weighted density method to reduce the impact of outliers on clustering results. Then the center point is selected by the max-min distance to avoid the clustering result falling into local optimum. Finally, the value of [K] is determined by the ratio of the distance within clusters to the distance between clusters. Experiments show that the improved algorithm not only improves the accuracy of clustering, reduces the average iteration times of the algorithm, but also enhances the stability of the algorithm.

Key words: K-means, initial center, outliers, density method, max-min distance

摘要：

随机选取初始聚类中心和根据经验设置[K]值对[K]-means聚类结果都有一定的影响，针对这一问题，提出了一种基于加权密度和最大最小距离的[K]-means聚类算法，称为[KWDM]算法。该算法利用加权密度法选取初始聚类中心点集，减少了离群点对聚类结果的影响，通过最大最小距离准则启发式地选择聚类中心，避免了聚类结果陷入局部最优，最后使用准则函数即簇内距离和簇间距离的比值来确定[K]值，防止了根据经验来设置[K]值。在人工数据集和UCI数据集上的实验结果表明，KWDM算法不仅提高了聚类的准确率，而且减少了算法的平均迭代次数，增强了算法的稳定性。

关键词: K-means, 初始中心, 离群点, 密度法, 最大最小距离

MA Keqin, YANG Yanjiao, QIN Hongwu, GENG Lin, WANG Pidong. K-means Clustering Algorithm Combining Max-Min Distance and Weighted Density[J]. Computer Engineering and Applications, 2020, 56(16): 50-54.

马克勤，杨延娇，秦红武，耿琳，王丕栋. 结合最大最小距离和加权密度的K-means聚类算法[J]. 计算机工程与应用, 2020, 56(16): 50-54.

[1]	WANG Changlong, ZHANG Yuandong, MIAO Hong, YANG Yuheng. Application of Double Channel Convolutional Neural Network in Pumpkin Diseases Identification [J]. Computer Engineering and Applications, 2021, 57(5): 183-189.
[2]	ZHANG Ziran, HUANG Weihua, CHEN Yang, ZHANG Zheng, LI Ziyuan. Improved Ant Colony Path Planning Algorithm Based on Bidirectional Search [J]. Computer Engineering and Applications, 2021, 57(21): 270-277.
[3]	CHENG Jingyi, DUAN Xianhua, ZHU Wei. Research on Metal Surface Defect Detection by Improved YOLOv3 [J]. Computer Engineering and Applications, 2021, 57(19): 252-258.
[4]	PAN Chengsheng, ZHANG Bin, LYU Yana, DU Xiuli, QIU Shaoming. K-Means Text Clustering Based on Improved Gray Wolf Optimization Algorithm [J]. Computer Engineering and Applications, 2021, 57(1): 188-193.
[5]	GAO Weijun, SHI Yang, YANG Jie, ZHANG Chunxia. An Improved Lightweight Head Detection Method [J]. Computer Engineering and Applications, 2021, 57(1): 207-212.
[6]	LU Junjie, HUANG Jinquan, LU Feng. Likelihood K-means Clustering for Gas Path Failure Diagnostics of Turbofan Engine [J]. Computer Engineering and Applications, 2020, 56(9): 136-141.
[7]	WANG Weihong, ZENG Yingjie. Collaborative Filtering Recommendation Algorithm Based on Clustering and User Preference [J]. Computer Engineering and Applications, 2020, 56(3): 68-73.
[8]	ZONG Xiaoping, TIAN Weiqian. Segmentation and Feature Extraction of Brain Tumor Based on Magnetic Resonance Image Using K-means [J]. Computer Engineering and Applications, 2020, 56(3): 187-193.
[9]	WANG Zilong, LI Jin, SONG Yafei. Improved K-means Algorithm Based on Distance and Weight [J]. Computer Engineering and Applications, 2020, 56(23): 87-94.
[10]	ZHANG Zhen, LI Haofang, LI Mengzhou. Research on YOLO Algorithm in Abnormal Security Images [J]. Computer Engineering and Applications, 2020, 56(21): 187-193.
[11]	SUN Zhiran, SU Hang, LIANG Yi. Improved K-Prototypes Clustering Algorithm [J]. Computer Engineering and Applications, 2020, 56(21): 54-59.
[12]	MA Jinghui, PAN Wei, WANG Ru. 3D Point Cloud Classification Based on K-means Clustering [J]. Computer Engineering and Applications, 2020, 56(17): 181-186.
[13]	GUO Yongkun, ZHANG Xinyou, LIU Liping, DING Liang, NIU Xiaolu. K-means Clustering Algorithm of Optimizing Initial Clustering Center [J]. Computer Engineering and Applications, 2020, 56(15): 172-178.
[14]	LI Feng, LI Mingxiang, ZHANG Yujing. Partial Iterative Fast K-means Clustering Algorithm [J]. Computer Engineering and Applications, 2020, 56(13): 63-71.
[15]	WANG Jianren, MA Xin, DUAN Ganglong. Improved K-means Clustering k-Value Selection Algorithm [J]. Computer Engineering and Applications, 2019, 55(8): 27-33.

K-means Clustering Algorithm Combining Max-Min Distance and Weighted Density

结合最大最小距离和加权密度的K-means聚类算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics