Computer Engineering and Applications ›› 2009, Vol. 45 ›› Issue (24): 40-42.DOI: 10.3778/j.issn.1002-8331.2009.24.013
• 研究、探讨 • Previous Articles Next Articles
WANG Jie,JIANG Guo-qiang
Received:
Revised:
Online:
Published:
Contact:
王 杰,姜国强
郑州大学 电气工程学院,郑州 450001
通讯作者:
Abstract: Discretization of continuous attribute is an important pretreatment process in the knowledge discovery study.A novel algorithm of supervised discretization of continuous attributes based on nearest neighbor-clustering and the theories of rough set is introduced.The algorithm is dealt with in two stages,in the first stage,the number of clusters is adjusted dynamically by using the nearest neighbour-clustering,and then the initial clusters are determined.In the second stage,the adjacent region is merged by adopting the definition of merging similar range,and then the clusters are reduced.The algorithm is proved effectively by the case analysis.
Key words: discretization, nearest neighbor-clustering, rough set, approximate classification quality
摘要: 连续属性离散化是知识发现研究中重要的预处理过程,基于最近邻聚类和粗集的相关理论,提出一种新的有监督的多属性离散化方法。该算法分两个阶段来处理,首先利用最近邻聚类动态调整聚类的类别数,生成初始聚类。然后基于类信息的相似性定义合并相似区间,减少了聚类区间。通过实例分析,该算法是非常有效的。
关键词: 离散化, 最近邻聚类, 粗集, 近似分类质量
CLC Number:
TP301.6
WANG Jie,JIANG Guo-qiang. Novel attribute discretization algorithm based on nearest neighbor-clustering[J]. Computer Engineering and Applications, 2009, 45(24): 40-42.
王 杰,姜国强. 新的基于最近邻聚类的属性离散化算法[J]. 计算机工程与应用, 2009, 45(24): 40-42.
0 / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://cea.ceaj.org/EN/10.3778/j.issn.1002-8331.2009.24.013
http://cea.ceaj.org/EN/Y2009/V45/I24/40