计算机工程与应用 ›› 2019, Vol. 55 ›› Issue (4): 48-55.DOI: 10.3778/j.issn.1002-8331.1811-0325

• 热点与综述 • 上一篇    下一篇

基于标签特征和相关性的多标签分类算法

李  锋,杨有龙   

  1. 西安电子科技大学 数学与统计学院,西安 710126
  • 出版日期:2019-02-15 发布日期:2019-02-19

Multi-Label Classification Algorithm Based on Label-Specific Features and Label Correlation

LI Feng, YANG Youlong   

  1. School of Mathematics and Statistics, Xidian University, Xi’an 710126, China
  • Online:2019-02-15 Published:2019-02-19

摘要: 针对标签特有特征和标签相关性的有效利用,提出了一种新的多标签算法LSFLC,它可以有效地集成标签特有特征和标签相关性。首先,对于每个标签,通过重采样技术生成新的正类实例以扩充其正类实例的数目;其次,通过特征映射函数将原始特征空间转换为特定的特征空间,得到每个标签的标签特征集;然后,对于每个标签,找到与其最相关标签,通过复制该标签的正类实例来扩大标签特征集,这不仅丰富了标签的信息,而且在一定程度上改善了类不平衡的问题;最后,对于不同的数据集进行实验分析,实验结果表明该算法的分类效果更好。

关键词: 多标签学习, 局部标签相关性, 标签特有特征, 相关实例补充

Abstract: Aiming at the effective utilization of label specific features and label correlation, a new multi-label algorithm LSFLC is proposed, which can effectively integrate label specific features and label correlation. Firstly, for each label, new positive class instances are generated by re-sampling technology to expand the number of positive class instances. Secondly, the original feature space is transformed into a specific feature space by feature mapping function, and the label specific feature set of each label is obtained. Then, for each label, its most relevant label is found to expand its specific feature set by copying the positive class instances of the label, which not only enriches the information of the label, but also solves the problem of class imbalance to a certain extent. Finally, experiments on different data sets show that the classification effect of the proposed algorithm is better.

Key words: multi-label learning, local label correlation, label-specific features, related instances insertion