计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (2): 121-123.DOI: 10.3778/j.issn.1002-8331.2011.02.038

• 数据库、信号与信息处理 • 上一篇    下一篇

扩散模式的聚类算法研究

黄俊恒1,孙玉山2,朱东杰2   

  1. 1.哈尔滨工业大学(威海) 计算机与科学技术学院,山东 威海 264209
    2.哈尔滨工业大学(威海) 软件学院,山东 威海 264209
  • 收稿日期:2009-05-18 修回日期:2009-07-08 出版日期:2011-01-11 发布日期:2011-01-11
  • 通讯作者: 黄俊恒

Research of clustering algorithm based on diffusion model

HUANG Junheng1,SUN Yushan2,ZHU Dongjie2   

  1. 1.Department of Computer Science & Technology,Harbin Institute of Technology,Weihai,Shandong 264209,China
    2.School of Software,Harbin Institute of Technology,Weihai,Shandong 264209,China
  • Received:2009-05-18 Revised:2009-07-08 Online:2011-01-11 Published:2011-01-11
  • Contact: HUANG Junheng

摘要: 针对各种扩散模式数据点分布的聚类问题,提出了一种基于密度变化的聚类算法(CDD)。CDD采用基于密度的典型聚类算法(DBSCAN)寻找核心点,通过分析数据样本及其周围点密度的扩散规律,计算密度扩散的方向、速度和加速度,对数据样本进行聚类。实验结果表明:与DBSCAN相比,能准确对扩散模式数据进行聚类,对非扩散模式数据具有抗噪声干扰能力强,参数较易确定的优点。

关键词: 聚类, 数据挖掘, 扩散模式

Abstract: Aiming at the various distribute clustering problems in diffusion model for all data points,a new clustering algorithm(CDD) based on the change of density is proposed.CDD searches the core point using a typical clustering algorithm(DBSCAN) based on the density,it calculates the direction,speed and acceleration of density diffused through analyzing the diffusion rule of data sample and its around the point’ density,then completes the sample points’ clustering.The experimental results show that compared with DBSCAN,CDD can cluster the diffusion model accurately,and has strong anti-noise-interference ability for the non-diffusion model which makes it easier to determine the merits of the parameters.

Key words: clustering, data mining, diffusion model

中图分类号: