基于压缩编码的迁移学习算法研究

doi:10.3778/j.issn.1002-8331.1608-0396

计算机工程与应用 ›› 2018, Vol. 54 ›› Issue (3): 142-149.DOI: 10.3778/j.issn.1002-8331.1608-0396

基于压缩编码的迁移学习算法研究

邵浩

上海对外经贸大学，上海 200336

出版日期:2018-02-01 发布日期:2018-02-07

Transfer learning based on compact coding

SHAO Hao

Shanghai University of International Business and Economics, Shanghai 200336, China

Online:2018-02-01 Published:2018-02-07

摘要/Abstract

摘要： 在生产实际中，一个新的任务通常和已有任务存在一定的联系。迁移学习方法可以将已有数据集中的有用信息，迁移到新的任务，以减少重新建模过程中大量的时间和费用消耗。然而，由于任务之间的分布差异，在异构环境下如何避免负面迁移问题，仍未得到有效的解决。除了要衡量数据间的相似性，还需要衡量实例间的相关性，而大多数传统方法仅在一个层面进行操作。提出了基于压缩编码的迁移学习方法（TLCC），建立了两个层面的算法模型，具体来说，在数据层面，数据间的相似性可以表示为超平面分类器的编码长度，而在实例层面，通过进一步挑选出有价值的实例进行迁移，提升算法性能，避免负面迁移的发生。实验结果表明，提出的算法相比其他算法具有明显的优势，在噪声环境下也有较高的准确度。

关键词: 压缩编码, 分类, 负面迁移, 迁移学习

Abstract: In real world applications such as manufacturing, a new task is often related to another existing task. Transfer learning techniques are developed to build novel models on new tasks by extracting useful information from the existing models, to reduce the high cost of inquiring the labeled information for the target task. However, how to avoid negative transfer which happens due to different distributions of tasks in a heterogeneous environment is still an open problem. Unlike traditional methods which only measure either similarity between tasks or instance relatedness, a Transfer Learning method with Compact Coding（TLCC） is proposed under a two-level framework in inductive transfer learning setting. Particularly speaking, in the macro level perspective, the degree of the similarity is represented by the relevant code length of the class boundary of each source task with respect to the target task through minimum encoding. In addition, informative instances of the source tasks are adaptively selected in the micro level viewpoint to make the choice of the specific source task more accurate. Extensive experiments show the effectiveness of the algorithm in terms of the classification accuracy in both UCI and text data sets.

Key words: compact coding, classification, negative transfer, transfer learning

邵浩. 基于压缩编码的迁移学习算法研究[J]. 计算机工程与应用, 2018, 54(3): 142-149.

SHAO Hao. Transfer learning based on compact coding[J]. Computer Engineering and Applications, 2018, 54(3): 142-149.

[1]	王永贵，李倩玉. 基于KNN-GBDT的混合协同过滤推荐算法[J]. 计算机工程与应用, 2021, 57(9): 103-108.
[2]	杨春霞，李欣栩，吴佳君，刘天宇. 基于注意力交互机制的层次网络情感分类[J]. 计算机工程与应用, 2021, 57(9): 134-139.
[3]	桑江徽，姜海燕. 基于联合分布的多标记迁移学习[J]. 计算机工程与应用, 2021, 57(9): 154-161.
[4]	张韩钰，吴志昊，徐勇，陈斌. 增强卷积神经网络的人脸篡改检测方法[J]. 计算机工程与应用, 2021, 57(8): 220-224.
[5]	许德刚，王露，李凡. 深度学习的典型目标检测算法研究综述[J]. 计算机工程与应用, 2021, 57(8): 10-25.
[6]	李俊丽. Spark平台下类别数据互信息计算的并行化[J]. 计算机工程与应用, 2021, 57(7): 95-100.
[7]	韩卫宇，程龙生. 结合马田系统-SVM的滚动轴承故障模式分类研究[J]. 计算机工程与应用, 2021, 57(6): 239-246.
[8]	霍光煜，张勇，孙艳丰，尹宝才. 基于语义的档案数据智能分类方法研究[J]. 计算机工程与应用, 2021, 57(6): 247-253.
[9]	韩东方，吐尔地·托合提，艾斯卡尔·艾木都拉. 问答系统中问句分类方法研究综述[J]. 计算机工程与应用, 2021, 57(6): 10-21.
[10]	黄金杰，蔺江全，何勇军，何瑾洁，王雅君. 局部语义与上下文关系的中文短文本分类算法[J]. 计算机工程与应用, 2021, 57(6): 94-100.
[11]	李硕，梁毅. 面向Spark的批处理应用执行时间预测模型[J]. 计算机工程与应用, 2021, 57(5): 79-87.
[12]	王凤琴，柯亨进. 卷积神经网络及其分析在抑郁症判别中的应用[J]. 计算机工程与应用, 2021, 57(5): 245-250.
[13]	万亚玲，钟锡武，刘慧，钱育蓉. 卷积神经网络在高光谱图像分类中的应用综述[J]. 计算机工程与应用, 2021, 57(4): 1-10.
[14]	徐可文，许波，吴英，徐浩然. 机器学习在超声图像中的应用综述[J]. 计算机工程与应用, 2021, 57(4): 11-17.
[15]	陶体伟，刘明霞，王明亮，王琳琳，杨德运，张强. 基于有效距离的低秩表示[J]. 计算机工程与应用, 2021, 57(4): 141-147.

基于压缩编码的迁移学习算法研究

Transfer learning based on compact coding

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics