计算机工程与应用 ›› 2012, Vol. 48 ›› Issue (34): 17-22.

• 博士论坛 • 上一篇    下一篇

集成学习算法的研究与应用

侯  勇1,2,郑雪峰1   

  1. 1.北京科技大学 计算机与通信工程学院,北京 100083 
    2.山东经贸职业学院 科学与人文学院,山东 潍坊 261011
  • 出版日期:2012-12-01 发布日期:2012-11-30

Study of ensemble algorithm and its application

HOU Yong 1,2, ZHENG Xuefeng1   

  1. 1.School of Computer & Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China
    2.College of Humanities and Science, Shandong Vocational College of Economics and Business, Weifang, Shandong 261011, China
  • Online:2012-12-01 Published:2012-11-30

摘要: 集成学习算法的思想就是集成多个学习器,并组合它们的预测结果,以形成最终的结论。典型的学习模型组合方法有投票法,专家混合方法,堆叠泛化法与级联法,但这些方法的性能都有待进一步提高。提出了一种新颖的集成学习算法——增强的集成学习算法(ReinforcedEnsemble)。ReinforcedEnsemble集成算法由两大部分组成:ReinforcedEnsemble特征提取算法与ReinforcedEnsemble基分类器。通过实验,将ReinforcedEnsemble算法与其他集成学习算法进行了性能比较。实验结果表明,所提出的算法在多项指标上均达到最优。

关键词: 特征提取, 最大间隔, 多层感知器, 集成算法, KDDCUP99数据集, 入侵检测

Abstract: The idea of ensemble learning is to employ multiple learners and combine their predictions. The typical methods of combining multiple models such as bagging, boosting, stacking error correcting output codes, voting, mixtures of experts, stacked generalization and cascading. Though a considerable effort has been put into developing statistical models and algorithmic strategies for classification, the accurate of the classification has been proven to be very challenging. A novel ensemble algorithm, ReinforcedEnsemble is proposed. ReinforcedEnsemble ensemble algorithm consists of two parts, ReinforcedEnsemble feature extraction algorithm and ReinforcedEnsemble base classifier. The performance between ReinforcedEnsemble and other ensemble algorithm in the experiments is compared. The experimental results show that the proposed algorithm is optimal in a number of indicators.

Key words: feature extraction, maximum margin, multilayer perceptron, assemble algorithm, KDDCUP99 data set, intrusion detection