计算机工程与应用 ›› 2015, Vol. 51 ›› Issue (21): 128-132.

• 数据库、数据挖掘、机器学习 • 上一篇    下一篇

基于空间重叠度的DTBSVM多类分类算法

胡小莉,陈秀宏   

  1. 江南大学 数字媒体学院,江苏 无锡 214122
  • 出版日期:2015-11-01 发布日期:2015-11-16

DTBSVM multi-class classification algorithm based on space overlapping region

HU Xiaoli, CHEN Xiuhong   

  1. School of Digital Media, Jiangnan University, Wuxi, Jiangsu 214122, China
  • Online:2015-11-01 Published:2015-11-16

摘要: 针对传统的DTBSVM算法中判断类间的可分的难易程度时可能造成的错误判断,提出了基于空间重叠度的DTSVM多类分类方法。该方法通过计算已知的类别样本在空间中的重叠度,合并有重叠的类,组合为一个新的类,再基于一种有效的类间可分性准则进行划分,使得容易划分的类能从决策树的根节点开始逐层分割出来,再划分有类间重叠的类,这样就可以尽量地避免“误差累积”的风险,构造出分类效果好的决策树结构。实验结果表明,该方法大大提升了DTSVM多类分类算法的分类正确率。

关键词: 支持向量机, 决策树, 空间重叠度, 类间可分性, 多类分类

Abstract: For the traditional DTBSVM algorithm may have the misjudgement of the difficulty to separate the classes, a method which is based on the space overlapping region DTSVM multi-classification is proposed. The method will compute the space overlapping region by the labeled samples, then merges those classes who have the overlapped regions, and combines them into a new class, then uses an effective extra-class measurement of separability to devide the new labeled classes, to make sure that those classes who are easy to be partitioned will be separated from the root of the decision tree. It separates those classes who have overlapping regions, if do like this, the algorithm will avoid the risk of the cumulative of error successfully, and constructs better classification results decision tree. The experiment shows that the method will enhance the accuracy of DTBSVM multi-classification.

Key words: support vector machines, decision tree, space overlapping region, extra-class measurement of separability, multi-classification