Computer Engineering and Applications ›› 2013, Vol. 49 ›› Issue (13): 145-148.

Previous Articles     Next Articles

Fast generalized noise clustering algorithm

WU Bin1,2, WU Xiaohong3,4, JIA Hongwen2   

  1. 1.School of Information and Computer Science, Anhui Agricultural University, Hefei 230036, China
    2.Department of Information Engineering, Chuzhou Vocational Technology College, Chuzhou, Anhui 239000, China
    3.School of Electrical and Information Engineering, Jiangsu University, Zhenjiang, Jiangsu 212013, China
    4.School of Food and Biological Engineering, Jiangsu University, Zhenjiang, Jiangsu 212013, China
  • Online:2013-07-01 Published:2013-06-28

一种快速的广义噪声聚类算法

武  斌1,2,武小红3,4,贾红雯2   

  1. 1.安徽农业大学 信息与计算机学院,合肥 230036
    2.滁州职业技术学院 信息工程系,安徽 滁州 239000
    3.江苏大学 电气信息工程学院,江苏 镇江 212013
    4.江苏大学 食品与生物工程学院,江苏 镇江 212013

Abstract: A Fast Generalized Noise Clustering(FGNC) based on Generalized Noise Clustering(GNC) objective function and Possibilistic Clustering Algorithm(PCA) is proposed to deal with the shortcoming of GNC algorithm depend heavily on parameters, and FCM must be performed until termination to calculate the parameters for GNC algorithm. With a nonparametric method, FGNC calculates the parameters in GNC objective function. So FGNC algorithm does not depend on the parameters that GNC holds and clusters data faster than GNC algorithm. Experiment and simulation on two man-made data sets and two real data sets show FGNC can deal with noisy data well, cluster centers are closer to real ones, clustering accuracy is improved and clustering time is reduced.

Key words: Fuzzy C-Means(FCM), Possibilistic C-Means(PCM), Generalized Noise Clustering(GNC)

摘要: 为解决广义噪声聚类(GNC)算法非常依赖参数和在运行GNC算法前必须运行FCM算法以便计算参数的缺点,在GNC的目标函数和可能聚类算法(PCA)基础上,提出一种快速的广义噪声聚类(FGNC)算法。FGNC算法通过一种非参数化方法计算GNC目标函数中的参数,因而FGNC算法不依赖参数并且聚类速度快于GNC算法。对人工含噪声数据集和两个实际数据集进行仿真实验,实验结果表明FGNC算法能很好地处理含噪声数据,具有聚类中心更接近真实聚类中心,聚类准确性高,聚类时间少的优良性能。

关键词: 模糊C-均值聚类, 可能C-均值聚类, 广义噪声聚类