计算机工程与应用 ›› 2009, Vol. 45 ›› Issue (10): 147-150.DOI: 10.3778/j.issn.1002-8331.2009.10.044

• 数据库、信号与信息处理 • 上一篇    下一篇

数据聚类分析新方法研究

廖志芳,李 鹏,刘克准,樊晓平,瞿志华   

  1. 中南大学 信息科学与工程学院,长沙 410075
  • 收稿日期:2008-02-20 修回日期:2008-04-28 出版日期:2009-04-01 发布日期:2009-04-01
  • 通讯作者: 廖志芳

New clustering algorithm based on lattice

LIAO Zhi-fang,LI Peng,LIU Ke-zhun,FAN Xiao-ping,QU Zhi-hua   

  1. School of Information Science and Engineering,Central South University,Changsha 410075,China
  • Received:2008-02-20 Revised:2008-04-28 Online:2009-04-01 Published:2009-04-01
  • Contact: LIAO Zhi-fang

摘要: 信息技术不断的进步,现实世界中需要处理的数据已由单一的数值型逐渐转变成由数值、文本、符号等类型构成的混合型数据。与现存大量的面向数值型数据的聚类算法相比,能有效处理混合型数据的聚类算法相对较少。为此,在格论基础上提出了一种适用于混合数据的聚类算法,该算法根据对象间格的覆盖数量来度量相似度,根据高覆盖数高相似度的原则选择聚类中心进行聚类。实验结果表明与其他传统聚类算法相比,新算法在不增加空间复杂度的情况下有效地提高了聚类的质量。

Abstract: With the development of information technology,mixed data should be processed which changes from the simple numerical or categorical to mixed value.Compared with clustering methods for pure numerical data,the methods can process mixed numerical effectively are very few.This paper presents a new algorithm based on lattice for multiple-attribute which uses lattice covers to calculate the similarity between two objects.In the algorithm,each sample will be distributed to the cluster where it gets the maximum cover count number.Experiment shows that the new algorithm is more efficient than the other classical algorithms.