基于异构分类器集成的增量学习算法

doi:10.3778/j.issn.1002-8331.1812-0188

计算机工程与应用 ›› 2020, Vol. 56 ›› Issue (7): 155-161.DOI: 10.3778/j.issn.1002-8331.1812-0188

基于异构分类器集成的增量学习算法

熊霖，唐万梅

重庆师范大学计算机与信息科学学院，重庆 401331

出版日期:2020-04-01 发布日期:2020-03-28

Incremental Learning Algorithm Based on Heterogeneous Classifier Ensemble

XIONG Lin, TANG Wanmei

College of Computer and Information Science, Chongqing Normal University, Chongqing 401331, China

Online:2020-04-01 Published:2020-03-28

摘要/Abstract

摘要：

将集成学习的思想引入到增量学习之中可以显著提升学习效果，近年关于集成式增量学习的研究大多采用加权投票的方式将多个同质分类器进行结合，并没有很好地解决增量学习中的稳定-可塑性难题。针对此提出了一种异构分类器集成增量学习算法。该算法在训练过程中，为使模型更具稳定性，用新数据训练多个基分类器加入到异构的集成模型之中，同时采用局部敏感哈希表保存数据梗概以备待测样本近邻的查找；为了适应不断变化的数据，还会用新获得的数据更新集成模型中基分类器的投票权重；对待测样本进行类别预测时，以局部敏感哈希表中与待测样本相似的数据作为桥梁，计算基分类器针对该待测样本的动态权重，结合多个基分类器的投票权重和动态权重判定待测样本所属类别。通过对比实验，证明了该增量算法有比较高的稳定性和泛化能力。

关键词: 增量学习, 集成学习, 局部敏感哈希, 异构分类器集成, 动态权重

Abstract:

Introducing the idea of ensemble learning into incremental learning can improve the learning effect. In recent years, most of the research on ensemble incremental learning combines multiple homogeneous classifiers with weighted voting method, which does not solve the problem of stability-plasticity in incremental learning very well. An incremental learning algorithm based on heterogeneous classifier ensemble is proposed. In the stage of training, to make the ensemble model more stable, many base classifiers are trained with new data and then append into heterogeneous ensemble model. Meanwhile, Locality-Sensitive Hashing is used to save the data sketch for the nearest neighbor search of test sample. In order to adapt to the changing data, the newly acquired data will be used to update the voting weight of the base classifier in the ensemble model. In the prediction stage, for the class label prediction of the test sample, the data similar to the test sample is found from the Local-Sensitive Hashing table, this data is used as a bridge to calculate the dynamic weight of the base classifier for the test sample. It determines the class label of the test sample by combined the voting weight and dynamic weight of many base classifiers. Through comparative experiments, it is proved that the proposed algorithm has well stability and generalization ability.

Key words: incremental learning, ensemble learning, Locality-Sensitive Hashing（LSH）, heterogeneous classifiers ensemble, dynamic weight

熊霖，唐万梅. 基于异构分类器集成的增量学习算法[J]. 计算机工程与应用, 2020, 56(7): 155-161.

XIONG Lin, TANG Wanmei. Incremental Learning Algorithm Based on Heterogeneous Classifier Ensemble[J]. Computer Engineering and Applications, 2020, 56(7): 155-161.

[1]	吴文龙，周喜，王轶，王保全. WKAG：一种针对不平衡医保数据的欺诈检测方法[J]. 计算机工程与应用, 2021, 57(9): 247-254.
[2]	李莉，纪欣沅，宋嵩. 回环软件缺陷数量预测模型[J]. 计算机工程与应用, 2021, 57(7): 158-163.
[3]	丁智慧，乔钢柱，程谭，宿荣. 基于LSH的shapelets转换方法[J]. 计算机工程与应用, 2021, 57(3): 112-119.
[4]	王琴，刘盾. 结合集成学习的序贯三支情感分类方法研究[J]. 计算机工程与应用, 2021, 57(23): 211-218.
[5]	顾兆军，吴优，赵春迪，周景贤. 流量的集成学习与重采样均衡分类方法[J]. 计算机工程与应用, 2020, 56(6): 86-91.
[6]	赵宇鑫，努尔布力，艾壮. 基于集成学习投票算法的Android恶意应用检测[J]. 计算机工程与应用, 2020, 56(22): 74-82.
[7]	王得雪，林意，陈俊杰. 协同训练算法在滚动轴承故障诊断中的应用[J]. 计算机工程与应用, 2020, 56(12): 273-278.
[8]	徐浩然，许波，徐可文. 机器学习在股票预测中的应用综述[J]. 计算机工程与应用, 2020, 56(12): 19-24.
[9]	李雄伟1，魏延海1，王晓晗1，徐璐2，孙萍3. 一种面向硬件木马检测的SVDD增量学习改进算法[J]. 计算机工程与应用, 2019, 55(9): 43-48.
[10]	苏健民，杨岚心，景维鹏. 基于U-Net的高分辨率遥感图像语义分割方法[J]. 计算机工程与应用, 2019, 55(7): 207-213.
[11]	邬阳阳，汤建国. 大数据背景下粗糙集属性约简研究进展[J]. 计算机工程与应用, 2019, 55(6): 31-38.
[12]	刘树栋，张可. 类别不均衡学习中的抽样策略研究[J]. 计算机工程与应用, 2019, 55(21): 1-17.
[13]	李哲，于梦茹. 基于多种LBP特征集成学习的车标识别[J]. 计算机工程与应用, 2019, 55(20): 134-138.
[14]	徐屹伟，刘政怡，赵悉超. 基于简单帧选择的显著性检测方法[J]. 计算机工程与应用, 2019, 55(20): 177-183.
[15]	余恩泽，努尔布力，于清. 一种基于集成学习的钓鱼网站检测方法[J]. 计算机工程与应用, 2019, 55(18): 81-88.

基于异构分类器集成的增量学习算法

Incremental Learning Algorithm Based on Heterogeneous Classifier Ensemble

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics