Dynamic Multi-label Text Classification Algorithm Based on Label Semantic Similarity

doi:10.3778/j.issn.1002-8331.1911-0033

Abstract

Abstract:

To solve the problem of dynamic multi-label text classification with time-varying labels, a dynamic multi label text classification algorithm based on label semantic similarity is proposed. In the training phase, a multi-label text classifier based on convolutional neural network is trained, and then the output of the penultimate layer of the classifier is taken as the feature vector of the text. Because the feature vector is trained with labels, it contains label semantic information compared with the content-based feature vector. In the test phase, the test document is input into the multi label text classifier in the training phase to obtain the corresponding feature vector, and then the cosine similarity is calculated. At the same time, a time attenuation factor is added to make the recent text have a higher similarity value. Finally, the nearest neighbor algorithm is used for classification. The experimental results show that the proposed algorithm has better performance in dealing with dynamic multi-label text classification problem.

Key words: dynamic multi-label, text classification, neural networks, label semantic similarity

摘要：

针对标签随着时间变化的动态多标签文本分类问题，提出了一种基于标签语义相似的动态多标签文本分类算法。该算法在训练阶段，首先按照标签固定训练得到一个基于卷积神经网络的多标签文本分类器，然后以该分类器的倒数第二层的输出为文本的特征向量。由于该特征向量是在有标签训练得到的，因而相对于基于字符串即文本内容而言，该特征向量含有标签语义信息。在测试阶段，将测试文档输入训练阶段的多标签文本分类器获取相应的特征向量，然后计算相似性，同时乘以时间衰减因子修正，使得时间越近的文本具有较高的相似性。最后，采用最近邻算法分类。实验结果表明，该算法在处理动态多标签文本分类问题上具有较优的性能。

关键词: 动态多标签, 文本分类, 神经网络, 标签语义相似

YAO Jiaqi, XU Zhengguo, YAN Jikun, XIONG Gang, LI Zhixiang. Dynamic Multi-label Text Classification Algorithm Based on Label Semantic Similarity[J]. Computer Engineering and Applications, 2020, 56(19): 94-98.

姚佳奇，徐正国，燕继坤，熊钢，李智翔. 基于标签语义相似的动态多标签文本分类算法[J]. 计算机工程与应用, 2020, 56(19): 94-98.

[1]	RAN Rong, XU Xinghua, QIU Shaohua, CUI Xiaopeng, OUYANG Bin. Review of Crack Detection Methods Based on Deep Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(9): 23-35.
[2]	MOU Qingping, ZHANG Ying, ZHANG Dongbo, WANG Xinjie, YANG Zhiqiao. Research on Visual Tracking Algorithm and Application of Target Loss Discrimination Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 140-147.
[3]	HUANG Dongyi, YANG Bing, WU Zihao, KUANG Jiayi, YAN Zeming. Spatio-Temporal Fully Connected Convolutional Neural Networks for Citywide Cellular Prediction [J]. Computer Engineering and Applications, 2021, 57(9): 168-175.
[4]	MA Zhexu, YANG Feng, QIAO Xu. Intelligent Detection Method of Railway Subgrade Defect [J]. Computer Engineering and Applications, 2021, 57(9): 272-278.
[5]	YANG Peiwei, ZHOU Yuhong, XING Gang, TIAN Zhiqiang, XU Xiayu. Applications of Convolutional Neural Network in Biomedical Image [J]. Computer Engineering and Applications, 2021, 57(7): 44-58.
[6]	HUO Guangyu, ZHANG Yong, SUN Yanfeng, YIN Baocai. Research on Archive Data Intelligent Classification Based on Semantic [J]. Computer Engineering and Applications, 2021, 57(6): 247-253.
[7]	HUANG Jinjie, LIN Jiangquan, HE Yongjun, HE Jinjie, WANG Yajun. Chinese Short Text Classification Algorithm Based on Local Semantics and Context [J]. Computer Engineering and Applications, 2021, 57(6): 94-100.
[8]	ZHENG Cheng, DONG Chunyang, HUANG Xiayan. Short Text Classification Method Based on BTM Graph Convolutional Network [J]. Computer Engineering and Applications, 2021, 57(4): 155-160.
[9]	LI Junxia, ZHANG Qin, ZHENG Guimei. Overview of Human Posture Recognition by Ultra-wideband Radar [J]. Computer Engineering and Applications, 2021, 57(3): 14-23.
[10]	HE Wenliang, ZHU Minling. Research Status and Future Analysis of Capsule Neural Network [J]. Computer Engineering and Applications, 2021, 57(3): 33-43.
[11]	MA Jinlin, ZHU Yanbin, MA Ziping, GONG Yuanwen, CHEN Deguang, LIU Yuhao. Review of Deep Learning Methods for Lip Recognition [J]. Computer Engineering and Applications, 2021, 57(24): 61-73.
[12]	TENG Jinbao, KONG Weiwei, TIAN Qiaoxin, WANG Zhaoqian, LI Long. Multi-channel Attention Mechanism Text Classification Model Based on CNN and LSTM [J]. Computer Engineering and Applications, 2021, 57(23): 154-162.
[13]	WU Shuzhao, LI Gongquan, BU Mingwei. Construction of Question Answering System for Suicide Tendency Detection Based on Knowledge Graph [J]. Computer Engineering and Applications, 2021, 57(22): 304-312.
[14]	LI Duo, DONG Chaoqun, SI Pinchao, HE Man, LIU Qianchao. Survey of Research on Neural Network Verification and Testing Technology [J]. Computer Engineering and Applications, 2021, 57(22): 53-67.
[15]	ZHU Meng, MIN Weidong, ZHANG Yu, DUAN Jingwen. Parallel Selective Kernel Attention Based on HardSoftmax [J]. Computer Engineering and Applications, 2021, 57(21): 95-101.

Dynamic Multi-label Text Classification Algorithm Based on Label Semantic Similarity

基于标签语义相似的动态多标签文本分类算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics