计算机工程与应用 ›› 2010, Vol. 46 ›› Issue (34): 115-118.DOI: 10.3778/j.issn.1002-8331.2010.34.035

• 数据库、信号与信息处理 • 上一篇    下一篇

决策树属性选择标准的改进

谢妞妞,刘於勋   

  1. 河南工业大学 信息科学与工程学院,郑州 450001
  • 收稿日期:2010-05-05 修回日期:2010-09-25 出版日期:2010-12-01 发布日期:2010-12-01
  • 通讯作者: 谢妞妞

Improvement of attribute selection criterion of decision trees

XIE Niu-niu,LIU Yu-xun   

  1. College of Information Science and Engineering,Henan University of Technology,Zhengzhou 450001,China
  • Received:2010-05-05 Revised:2010-09-25 Online:2010-12-01 Published:2010-12-01
  • Contact: XIE Niu-niu

摘要: 决策树算法是数据挖掘领域的一个研究热点,通常用来形成分类器和预测模型,在实际中有着广泛的应用。重点阐述了经典的ID3决策树算法,分析了它的优缺点,结合泰勒公式和麦克劳林公式提出了新的属性选择标准。改进后的算法通过简化信息熵的计算,提高了分类准确度,缩短了决策树的生成时间,减少了计算成本。实验证明,改进后算法的有效性和正确性。

Abstract: The decision tree algorithm is a research hotspot in the field of data mining,which is usually used form classifiers and prediction models.In practice,it is widely used.This paper focuses on classical ID3 algorithm,analyzes its advantages and disadvantages,combines with Taylor and Maclaurin formula,and then puts forward a new attribute selection criterion of decision trees.This modified algorithm improves the classification accuracy,reduces the generation time of decision trees,and shortens the computational cost by the calculation of the simplified information entropy.Experimental results show effectiveness and correctness of the improved algorithm.

中图分类号: