New clustering algorithm based on representatives and point density

doi:10.3778/j.issn.1002-8331.2008.28.046

Computer Engineering and Applications ›› 2008, Vol. 44 ›› Issue (28): 136-139.DOI: 10.3778/j.issn.1002-8331.2008.28.046

• 数据库、信号与信息处理 • Previous Articles Next Articles

New clustering algorithm based on representatives and point density

CHEN Yuan-yuan¹,CHEN Zhi-ping^1,2

1.College of Computer and Communication，Hunan University，Changsha 410082，China
2.Department of Computer Science and Technology，Tsinghua University，Beijing 100084，China

Received:2007-11-20 Revised:2008-02-18 Online:2008-10-01 Published:2008-10-01
Contact: CHEN Yuan-yuan

一种基于代表点和点密度的聚类算法

陈园园¹,陈治平^1,2

1.湖南大学计算机与通信学院，长沙 410082
2.清华大学计算机科学与技术系，北京 100084

通讯作者: 陈园园

Abstract

Abstract: Aimed to solve the problem that the density-based clustering algorithm dose not work well when data distribution is not even，a new clustering algorithm based on representatives and point density is provided.The algorithm discovers the clusters by examining k neighbors of each point in the data base.It chooses a seed point as the first representative and the representative’s k neighbors as its represent area.If the point in the represent areas satisfies the density threshold，this point will be a new representative.And repeating searching like this，all the linked represent areas and representatives will be a cluster.Experimental results show that this algorithm can discover clusters with arbitrary shapes and densities at different levels.

Key words: data mining, clustering, point density, representative, density threshold

摘要： 针对基于密度的聚类方法不能发现密度分布不均的数据样本的缺陷，提出了一种基于代表点和点密度的聚类算法。算法通过检查数据库中每个点的k近邻来寻找聚类。首先选取一个种子点作为类的第一个代表点，其k近邻为其代表区域，如果代表区域中的点密度满足密度阈值，则将该点作为一个新的代表点，如此反复地寻找代表点，这些区域相连的代表点及其代表区域将构成一个聚类。实验结果表明，该算法能够发现任意形状、大小和密度的聚类。

关键词: 数据挖掘, 聚类, 点密度, 代表点, 密度阈值

CHEN Yuan-yuan¹,CHEN Zhi-ping^1,2. New clustering algorithm based on representatives and point density[J]. Computer Engineering and Applications, 2008, 44(28): 136-139.

陈园园¹,陈治平^1,2. 一种基于代表点和点密度的聚类算法[J]. 计算机工程与应用, 2008, 44(28): 136-139.

[1]	LAN Hong, HUANG Min. Fusion of KNN Optimized Density Peaks and FCM Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 81-88.
[2]	GUO Xiaojing, SUI Haoda. Application of Improved YOLOv3 in Foreign Object Debris Target Detection on Airfield Pavement [J]. Computer Engineering and Applications, 2021, 57(8): 249-255.
[3]	LI Li, JI Xinyuan, SONG Song. Prediction Model for Number of Software Defects in Loop [J]. Computer Engineering and Applications, 2021, 57(7): 158-163.
[4]	HUO Guangyu, ZHANG Yong, SUN Yanfeng, YIN Baocai. Research on Archive Data Intelligent Classification Based on Semantic [J]. Computer Engineering and Applications, 2021, 57(6): 247-253.
[5]	LI Jingxing, YANG Youlong. Feature Selection of Markov Blanket for High Dimensional Data [J]. Computer Engineering and Applications, 2021, 57(6): 58-66.
[6]	YANG Fang, YIN Xi, SI Jianhui, LIU Hongyuan, WANG Xue. Mathematical Expression Similarity Calculation Method Based on Focus Clustering [J]. Computer Engineering and Applications, 2021, 57(6): 88-93.
[7]	ZONG Xiaoping, TAO Zeze. Knowledge Tracing Model Based on Mastery Speed [J]. Computer Engineering and Applications, 2021, 57(6): 117-123.
[8]	ZHAO Fan, ZHANG Lin, WEN Zhiquan, YANG Linlin, LIN Guangfeng. Direct and Efficient Natural Scene Chinese Character Approaching Spotting Method [J]. Computer Engineering and Applications, 2021, 57(6): 159-167.
[9]	PENG Qihui, XUAN Shibin, GAO Qing. Distribution Automatic Threshold Density Peak Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(5): 71-78.
[10]	LI Yongzhen, LIAO Husheng. Multi-view Clustering via Graph Convolutional Neural Network [J]. Computer Engineering and Applications, 2021, 57(5): 115-122.
[11]	WANG Changlong, ZHANG Yuandong, MIAO Hong, YANG Yuheng. Application of Double Channel Convolutional Neural Network in Pumpkin Diseases Identification [J]. Computer Engineering and Applications, 2021, 57(5): 183-189.
[12]	HU Xiaomin, WANG Mingfeng, ZHANG Shourong, LI Min. New Differential Evolution with Particle Swarm Optimization Algorithm for Text Clustering [J]. Computer Engineering and Applications, 2021, 57(4): 61-67.
[13]	WANG Junling, LU Xinming. Video Key Frame Extraction Algorithm Based on Semantic Correlation [J]. Computer Engineering and Applications, 2021, 57(4): 192-198.
[14]	WANG Fuyin, ZHANG Desheng, ZHANG Xiao. Adaptive Density Peaks Clustering Algorithm Combining with Whale Optimization Algorithm [J]. Computer Engineering and Applications, 2021, 57(3): 94-102.
[15]	GAO Tianyu, WANG Qingrong, YANG Lei. Data Mining Model Based on Attribute Dependability Enhancement of Rough Set [J]. Computer Engineering and Applications, 2021, 57(3): 87-93.

New clustering algorithm based on representatives and point density

一种基于代表点和点密度的聚类算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics