计算机工程与应用 ›› 2017, Vol. 53 ›› Issue (15): 47-56.DOI: 10.3778/j.issn.1002-8331.1603-0450

• 理论与研发 • 上一篇    下一篇

基于逻辑回归的多任务域快速分类学习算法

顾  鑫1,2,曹丹华1,吴裕斌1,栾永昕2,王伟成3   

  1. 1.华中科技大学 光学与电子信息学院,武汉 430074
    2.江苏北方湖光光电有限公司,江苏 无锡 214035
    3.南京大学 软件学院,江苏 无锡 210000
  • 出版日期:2017-08-01 发布日期:2017-08-14

Multi-task coupled logistic regression and its fast implementation for large multi-task datasets

GU Xin1,2, CAO Danhua1, WU Yubin1, LUAN Yongxin2, WANG Weicheng3   

  1. 1.School of Optical and Electronic Information, Huazhong University of Science and Technology, Wuhan 430074, China
    2.Jiangsu North Huguang Opto-Electronics Co. Ltd. , Wuxi, Jiangsu 214035, China
    3.Software Institute, Nanjing University, Wuxi, Jiangsu 210000, China
  • Online:2017-08-01 Published:2017-08-14

摘要: 多任务学习通过寻找并共享不同任务域之间的共性特征来完成学习,利用知识迁移加速不同任务域的学习为每个任务域构建一个分类器。提出了一种基于罗杰斯特回归模型的多任务学习方法MTC-LR(Multi-task Coupled Logistic Regression)。“罗杰斯特回归模型”已经被成功应用于单任务分类器上,该模型被众多实验证明是有效的,正是这种方法给人们带来了启示。从理论上证明了通过构造多任务分类器的“开销函数”和“差异性度量函数”,MTC-LR算法可以提高多任务分类器的各自分类精度。相比传统的基于SVM的多任务学习方法,MTC-LR并不依赖于核方法而是通过共轭梯度下降法寻找各个分类器的最优参数。同时MTC-LR与采用“罗杰斯特回归模型”的快速算法CDdual更容易结合,可扩展至大样本的多任务分类学习。正是基于上述发现,为了充分高效利用大样本的多任务域数据,满足大样本的快速运算,在MTC-LR算法的基础上,结合最新的CDdual(The Dual Coordinate Descent Method)算法,提出了MTC-LR的快速算法MTC-LR-CDdual,并对该算法进行了相关的理论分析。将该算法在人工数据集和真实数据集上进行了验证,实验结果表明该算法有着较高的识别率、快速的识别速度和较好的鲁棒性。

关键词: 多任务分类, 罗杰斯特回归, 后验概率, 对偶坐标下降法

Abstract: When facing multi-task learning problems, it is desirable that the learning method can find the correct input-output features and share the commonality among multiple domains and also scale up for large multi-task datasets. This paper introduces the multi-task coupled logistic regression framework called MTC-LR, which is a new method for generating each classifier for each task, capable of sharing the commonality among multi-task domains. The basic idea of MTC-LR is to use all individual logistic regression based classifiers, each one appropriate for each task domain, but in contrast to other SVM based proposals, learning all the parameter vectors of all individual classifiers by using the conjugate gradient method, in a global way and without the use of kernel trick, and being easily extended into its scaled version. This paper theoretically shows that the addition of a new term in the cost function of the set of logistic regressions(that penalizes the diversity among multiple tasks) produces a coupling of multiple tasks that allows MTC-LR to improve the learning performance in a logistic-regression way. This finding can make us easily integrate it with a state-of-the-art fast logistic regression algorithm called CDdual to develop its fast version MTC-LR-CDdual for large multi-task datasets. The proposed algorithm MTC-LR-CDdual is also theoretically analyzed. The experimental results on artificial and real datasets indicate the effectiveness of the proposed algorithm MTC-LR-CDdual in classification accuracy, speed and robustness.

Key words: multi-task classification learning, logistic regression, posterior probability, dual coordinate descent method