计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (24): 43-45.DOI: 10.3778/j.issn.1002-8331.2008.24.011

• 理论研究 • 上一篇    下一篇

一种基于核函数分割数据集的分类器组合算法

康 凯,张化祥,赵 斌   

  1. 山东师范大学 信息科学与工程学院,济南 250014
  • 收稿日期:2007-10-19 修回日期:2007-12-29 出版日期:2008-08-21 发布日期:2008-08-21
  • 通讯作者: 康 凯

Novel ensemble classifiers algorithm based on kernel dataset partition

KANG Kai,ZHANG Hua-xiang,ZHAO Bin   

  1. College of Information Science and Engineering,Shandong Normal University,Jinan 250014,China
  • Received:2007-10-19 Revised:2007-12-29 Online:2008-08-21 Published:2008-08-21
  • Contact: KANG Kai

摘要: 组合分类器通过在输入空间中依据一定的规则生成数据集来训练成员分类器。提出一种新的基于核函数的模糊隶属度方法用来分隔数据集,并依据数据集中样本的模糊隶属度将它们分为相对难分和相对易分的数据子集,根据两个数据子集的难易程度训练不同的分类器。并用得到的两类分类器作为成员分类器生成组合分类器。将该组合分类器应用到UCI的标准数据集,实验表明该方法比Bagging和AdaBoost算法具有更好的性能。

关键词: 模糊隶属度, 核函数, 组合分类器, 数据集分割

Abstract: The ensemble classifiers train its base classifiers using the datasets which are generated by some rules.This paper presents a fuzzy membership function based on kernel method to divide the training set into two parts,one is easy to classify while another is hard.Two different base classifiers are trained for fitting them;those two kinds of classifiers are integrated as base classifiers.This method is applied to classify the UCI benchmark datasets,and the experimental results show that this method is superior to Bagging and AdaBoost algorithms on the higher performance.

Key words: fuzzy memberships, kernel function, ensemble classifiers, dataset partition