Computer Engineering and Applications ›› 2009, Vol. 45 ›› Issue (30): 129-131.DOI: 10.3778/j.issn.1002-8331.2009.30.040

• 数据库、信号与信息处理 • Previous Articles     Next Articles

Improve KNN algorithm based on entropy method

WANG Zeng-min,WANG Kai-jue   

  1. School of Economics and Management,Beijing University of Posts and Telecommunications,Beijing 100876,China
  • Received:2009-04-27 Revised:2009-06-29 Online:2009-10-21 Published:2009-10-21
  • Contact: WANG Zeng-min

基于熵权的K最临近算法改进

王增民,王开珏   

  1. 北京邮电大学 经济管理学院,北京 100876
  • 通讯作者: 王增民

Abstract: Dimension disaster in the classified function will directly impact on the efficiency and the rate of accuracy of K-Nearest Neighbor algorithm(KNN),this paper combines information entropy theory used to reduce attributes and KNN algorithm,and also determines weight of characteristic attributes based on the degree of correlation between characteristic attributes and target attribute to establish the intrinsic relation between the degree of correlation and weight.The simulation experiment shows that compared with traditional KNN algorithm the improved KNN algorithm based on entropy method raises the rate of accuracy enormously in classification,meanwhile it also maintains efficiency of classification.

Key words: K-Nearest Neighbor(KNN) Algorithm, entropy weight, attributes reduction, classification

摘要: 维度灾难直接影响到K最临近算法(KNN)的效率和准确率,将信息论中的信息熵理论与KNN算法结合起来,用信息熵理论进行属性约简,并根据特征属性与分类的相关度来确定各属性的权限,从而建立相关度与权重的内在联系。仿真实验表明,与传统的KNN相比,基于熵权的KNN改进方法在保持分类效率的情况下,使分类器的准确率得到了极大的提高。

关键词: K最邻近算法, 熵权, 属性约简, 分类

CLC Number: