结合近邻和密度思想的K-均值算法的研究

计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (19): 147-149.

• 数据库、信号与信息处理 • 上一篇下一篇

结合近邻和密度思想的K-均值算法的研究

王春风，唐拥政

江苏盐城工学院现代教育技术中心，江苏盐城 224051

收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-07-01 发布日期:2011-07-01

Research of K-means algorithm combined with neighbors and density

WANG Chunfeng，TANG Yongzheng

Modern Education Technology Center，Yancheng Institute of Technology，Yancheng，Jiangsu 224051，China

Received:1900-01-01 Revised:1900-01-01 Online:2011-07-01 Published:2011-07-01

摘要/Abstract

摘要： 为了解决K-均值算法对初始聚类中心的依赖性，提出了一种新的选取初始聚类中心的算法。采用数据区内的最高密度点作为初始中心，基于近邻点属于同一聚类的特性，找到距离初始中心最远的点，将其加入初始聚类中心后再进行计算并依次下去的方法。该改进算法的初始聚类中心分布比较合理，而且剔除了孤立点对初始聚类中心的影响，从而可以得到更好的划分效果。实验表明，用改进的算法进行聚类更能够得到较高且稳定的准确率。

关键词: 密度, 近邻, 聚类算法, K-均值, 聚类中心

Abstract: In order to solve the dependence of initial cluster center，a new K-means algorithm based on the initial cluster center has been proposed.The new algorithm selects a point having the highest density as the initial center，and based on the characteristics of neighboring points belong to the same cluster，finds the point of the furthest distance from the initial center.Next，the point is added into the initial cluster center and is calculated，then it is turned down approach.The initial cluster center distribution of the improved algorithm is more reasonable，the influence of isolated points is eliminated，and the effect of delineation is more better.The experiment shows that the improved clustering algorithm has higher and more stable accuracy.

Key words: density, neighbors, clustering algorithm, K-means, cluster center

王春风，唐拥政. 结合近邻和密度思想的K-均值算法的研究[J]. 计算机工程与应用, 2011, 47(19): 147-149.

WANG Chunfeng，TANG Yongzheng. Research of K-means algorithm combined with neighbors and density[J]. Computer Engineering and Applications, 2011, 47(19): 147-149.

[1]	兰红，黄敏. 融合KNN优化的密度峰值和FCM聚类算法[J]. 计算机工程与应用, 2021, 57(9): 81-88.
[2]	李莉，纪欣沅，宋嵩. 回环软件缺陷数量预测模型[J]. 计算机工程与应用, 2021, 57(7): 158-163.
[3]	雷恒林，古兰拜尔·吐尔洪，买日旦·吾守尔，张东梅. 新奇检测综述[J]. 计算机工程与应用, 2021, 57(5): 47-55.
[4]	彭启慧，宣士斌，高卿. 分布的自动阈值密度峰值聚类算法[J]. 计算机工程与应用, 2021, 57(5): 71-78.
[5]	王俊玲，卢新明. 基于语义相关的视频关键帧提取算法[J]. 计算机工程与应用, 2021, 57(4): 192-198.
[6]	王芙银，张德生，张晓. 结合鲸鱼优化算法的自适应密度峰值聚类算法[J]. 计算机工程与应用, 2021, 57(3): 94-102.
[7]	张忠林，赵昱，闫光辉. 自然邻居密度极值聚类算法[J]. 计算机工程与应用, 2021, 57(23): 200-210.
[8]	王乐，韩萌，李小娟，张妮，程浩东. 不平衡数据集分类方法综述[J]. 计算机工程与应用, 2021, 57(22): 42-52.
[9]	梅婕，魏圆圆，许桃胜. 基于密度峰值多起始中心的融合聚类算法[J]. 计算机工程与应用, 2021, 57(22): 78-85.
[10]	左健豪，姜文刚. 自适应融合特征的人群计数网络[J]. 计算机工程与应用, 2021, 57(21): 203-208.
[11]	张子然，黄卫华，陈阳，章政，李梓远. 基于双向搜索的改进蚁群路径规划算法[J]. 计算机工程与应用, 2021, 57(21): 270-277.
[12]	陈倩茹，李雅丽，许科全，刘铱龙，王淑琴. 自调优自适应遗传算法的WKNN特征选择方法[J]. 计算机工程与应用, 2021, 57(20): 164-171.
[13]	丁松阳，田青云. Ball-Tree优化的密度峰值聚类算法[J]. 计算机工程与应用, 2021, 57(20): 90-96.
[14]	卫丹妮，杨有龙，仇海全. 结合密度峰值和切边权值的自训练算法[J]. 计算机工程与应用, 2021, 57(2): 70-76.
[15]	孟东霞，李玉鑑. 利用自然最近邻的不平衡数据过采样方法[J]. 计算机工程与应用, 2021, 57(2): 91-96.

结合近邻和密度思想的K-均值算法的研究

Research of K-means algorithm combined with neighbors and density

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics