Density Peaks Clustering Optimized by [K] Nearest Neighbor’s Similarity

doi:10.3778/j.issn.1002-8331.1710-0059

Computer Engineering and Applications ›› 2019, Vol. 55 ›› Issue (2): 148-153.DOI: 10.3778/j.issn.1002-8331.1710-0059

Previous Articles Next Articles

Density Peaks Clustering Optimized by [K] Nearest Neighbor’s Similarity

ZHU Qingfeng1，2, GE Hongwei1，2

1.Ministry of Education Key Laboratory of Advanced Process Control for Light Industry（Jiangnan University）, Wuxi, Jiangsu 214122, China
2.School of Internet of Things Engineering, Jiangnan University, Wuxi, Jiangsu 214122, China

Online:2019-01-15 Published:2019-01-15

[K]近邻相似度优化的密度峰聚类

朱庆峰1，2，葛洪伟1，2

1.轻工过程先进控制教育部重点实验室（江南大学），江苏无锡 214122
2.江南大学物联网工程学院，江苏无锡 214122

Abstract

Abstract: For the clustering of density peaks, only the distance between the sample point and the point of pointing （the nearest point of density is bigger than it） is considered, and it is not applicable to the problem of manifold clustering （such as Circleblock data set, Lineblobs data set, etc.）. A density peak clustering algorithm with [K] similarity optimization is proposed. After calculating the density and point of each point, find the [K] neighborhood of each point by the similarity function, and then judge whether the point of the sample point is correct according to the [K] proximity information. For the point pointing to the wrong point, it can effectively reduce the error distribution. Experiments on artificial datasets and UCI datasets show that the new algorithm has a higher accuracy rate.

Key words: clustering, density peaks, similarity, [K] nearest neighbor

摘要： 针对密度峰聚类分配时，仅考虑样本点与指向点（密度比它大的最近点）之间的距离，不适用于流形聚类（如Circleblock数据集、Lineblobs数据集等）的问题，提出了[K]近邻相似度优化的密度峰聚类算法。在计算每个点的密度与指向点后，通过相似度函数，找出每个点的[K]近邻，然后根据[K]近邻信息判断样本点的指向点是否正确，对于指向错误的点重新寻找正确的指向点，可以有效减少错误分配。在人工数据集和UCI数据集上的实验表明，新算法具有更高的准确率。

关键词: 聚类, 密度峰, 相似度, [K]近邻

ZHU Qingfeng1，2, GE Hongwei1，2. Density Peaks Clustering Optimized by [K] Nearest Neighbor’s Similarity[J]. Computer Engineering and Applications, 2019, 55(2): 148-153.

朱庆峰1，2，葛洪伟1，2. [K]近邻相似度优化的密度峰聚类[J]. 计算机工程与应用, 2019, 55(2): 148-153.

[1]	ZHANG Qishan, CHEN Lulu. Slope One Algorithm Based on Grey Correlational Analysis by Method of Degree of Balance and Approach [J]. Computer Engineering and Applications, 2021, 57(9): 96-102.
[2]	WANG Yonggui, LI Qianyu. Hybrid Collaborative Filtering Recommendation Algorithm Based on KNN-GBDT [J]. Computer Engineering and Applications, 2021, 57(9): 103-108.
[3]	LAN Hong, HUANG Min. Fusion of KNN Optimized Density Peaks and FCM Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 81-88.
[4]	ZHANG Xiaowen, REN Yongfeng. Image Matching Algorithm Combining Sparse Representation and Topological Similarity [J]. Computer Engineering and Applications, 2021, 57(8): 198-203.
[5]	GUO Xiaojing, SUI Haoda. Application of Improved YOLOv3 in Foreign Object Debris Target Detection on Airfield Pavement [J]. Computer Engineering and Applications, 2021, 57(8): 249-255.
[6]	ZHANG Songcan, PU Jiexin, SI Yanna, SUN Lifan. Adaptive Improved Ant Colony Algorithm Based on Population Similarity and Its Application [J]. Computer Engineering and Applications, 2021, 57(8): 70-77.
[7]	LI Li, JI Xinyuan, SONG Song. Prediction Model for Number of Software Defects in Loop [J]. Computer Engineering and Applications, 2021, 57(7): 158-163.
[8]	HUO Guangyu, ZHANG Yong, SUN Yanfeng, YIN Baocai. Research on Archive Data Intelligent Classification Based on Semantic [J]. Computer Engineering and Applications, 2021, 57(6): 247-253.
[9]	YANG Fang, YIN Xi, SI Jianhui, LIU Hongyuan, WANG Xue. Mathematical Expression Similarity Calculation Method Based on Focus Clustering [J]. Computer Engineering and Applications, 2021, 57(6): 88-93.
[10]	ZHAO Fan, ZHANG Lin, WEN Zhiquan, YANG Linlin, LIN Guangfeng. Direct and Efficient Natural Scene Chinese Character Approaching Spotting Method [J]. Computer Engineering and Applications, 2021, 57(6): 159-167.
[11]	PENG Qihui, XUAN Shibin, GAO Qing. Distribution Automatic Threshold Density Peak Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(5): 71-78.
[12]	QIAN Yunyun, YANG Wenzhong, YAO Miao, LI Hailei, CHAI Yachuang. Topic Community Discovery Model Incorporating Topic Similarity Weight [J]. Computer Engineering and Applications, 2021, 57(5): 107-114.
[13]	LI Yongzhen, LIAO Husheng. Multi-view Clustering via Graph Convolutional Neural Network [J]. Computer Engineering and Applications, 2021, 57(5): 115-122.
[14]	WANG Changlong, ZHANG Yuandong, MIAO Hong, YANG Yuheng. Application of Double Channel Convolutional Neural Network in Pumpkin Diseases Identification [J]. Computer Engineering and Applications, 2021, 57(5): 183-189.
[15]	HU Xiaomin, WANG Mingfeng, ZHANG Shourong, LI Min. New Differential Evolution with Particle Swarm Optimization Algorithm for Text Clustering [J]. Computer Engineering and Applications, 2021, 57(4): 61-67.

Density Peaks Clustering Optimized by [K] Nearest Neighbor’s Similarity

[K]近邻相似度优化的密度峰聚类

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics