局部语义与上下文关系的中文短文本分类算法

doi:10.3778/j.issn.1002-8331.1912-0185

计算机工程与应用 ›› 2021, Vol. 57 ›› Issue (6): 94-100.DOI: 10.3778/j.issn.1002-8331.1912-0185

局部语义与上下文关系的中文短文本分类算法

黄金杰，蔺江全，何勇军，何瑾洁，王雅君

1.哈尔滨理工大学自动化学院，哈尔滨 150080
2.哈尔滨理工大学计算机学院，哈尔滨 150080

出版日期:2021-03-15 发布日期:2021-03-12

Chinese Short Text Classification Algorithm Based on Local Semantics and Context

HUANG Jinjie, LIN Jiangquan, HE Yongjun, HE Jinjie, WANG Yajun

1.School of Automation, Harbin University of Science and Technology, Harbin 150080, China
2.School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, China

Online:2021-03-15 Published:2021-03-12

摘要/Abstract

摘要：

短文本通常是由几个到几十个词组成，长度短、特征稀疏，导致短文本分类的准确率难以提升。为了解决此问题，提出了一种基于局部语义特征与上下文关系融合的中文短文本分类算法，称为Bi-LSTM_CNN_AT，该算法利用CNN提取文本的局部语义特征，利用Bi-LSTM提取文本的上下文语义特征，并结合注意力机制，使得Bi-LSTM_CNN_AT模型能从众多的特征中提取出和当前任务最相关的特征，更好地进行文本分类。实验结果表明，Bi-LSTM_CNN_AT模型在NLP＆CC2017的新闻标题分类数据集18个类别中的分类准确率为81.31%，比单通道的CNN模型提高2.02%，比单通道的Bi-LSTM模型提高1.77%。

关键词: 短文本分类, 卷积神经网络, 双向长短时记忆网络, 注意力机制

Abstract:

Short text is usually composed of several to dozens of words. Short length and sparse features make it difficult to improve the classification accuracy of short texts. In order to solve this problem, an algorithm of classification for Chinese short texts is proposed based on local semantic features and context relationships, called Bi-LSTM_CNN_AT. In this algorithm, CNN is utilized to extract the local semantic features of a text, while Bi-LSTM is used to extract the contextual semantic features of the text. Moreover, the attention mechanism is combined too. Thus, the Bi-LSTM_CNN_AT model is able to extract the most relevant features to the current task from short texts. The experimental results show that the Bi-LSTM_CNN_AT model achieves a classification accuracy of 81.31% in the 18 categories of NLP&CC2017 news headline classification dataset, which is 2.02% higher than the single-channel CNN model and 1.77% higher than the single-channel Bi-LSTM model respectively.

Key words: short text classification, convolutional neural network, bidirectional long short-term memory network, attention mechanism

黄金杰，蔺江全，何勇军，何瑾洁，王雅君. 局部语义与上下文关系的中文短文本分类算法[J]. 计算机工程与应用, 2021, 57(6): 94-100.

HUANG Jinjie, LIN Jiangquan, HE Yongjun, HE Jinjie, WANG Yajun. Chinese Short Text Classification Algorithm Based on Local Semantics and Context[J]. Computer Engineering and Applications, 2021, 57(6): 94-100.

[1]	牟清萍，张莹，张东波，王新杰，杨知桥. 目标丢失判别机制的视觉跟踪算法及应用研究[J]. 计算机工程与应用, 2021, 57(9): 140-147.
[2]	包志强，邢瑜，吕少卿，黄琼丹. 改进YOLO V2的6D目标姿态估计算法[J]. 计算机工程与应用, 2021, 57(9): 148-153.
[3]	赵志焱，杨华，胡志伟，宇海萍. 基于TACNN的玉露香梨叶虫害识别[J]. 计算机工程与应用, 2021, 57(9): 176-181.
[4]	周伦钢，孙怡峰，王坤，吴疆，黄维贵，李炳龙. 目标多种多值属性的端端快速识别网络[J]. 计算机工程与应用, 2021, 57(9): 182-190.
[5]	张成，戴俊峰，熊闻心. 融合LeNet-5改进的扫描文档手写日期识别[J]. 计算机工程与应用, 2021, 57(9): 207-211.
[6]	张朕通，单玉刚，袁杰. 联合多尺度和注意力机制的遥感影像检测[J]. 计算机工程与应用, 2021, 57(9): 212-216.
[7]	麻哲旭，杨峰，乔旭. 铁路路基病害智能检测方法[J]. 计算机工程与应用, 2021, 57(9): 272-278.
[8]	许昊，张凯，田英杰，种法广，王子超. 深度神经网络图像描述综述[J]. 计算机工程与应用, 2021, 57(9): 9-22.
[9]	冉蓉，徐兴华，邱少华，崔小鹏，欧阳斌. 基于深度卷积神经网络的裂纹检测方法综述[J]. 计算机工程与应用, 2021, 57(9): 23-35.
[10]	赵圆丽，梁志剑. 基于异核卷积双注意机制的立场检测研究[J]. 计算机工程与应用, 2021, 57(8): 119-125.
[11]	张越，黄友锐，刘鹏坤. 引入注意力机制的多分辨率人体姿态估计研究[J]. 计算机工程与应用, 2021, 57(8): 126-132.
[12]	王玲，王家沛，王鹏，孙爽滋. 融合注意力机制的孪生网络目标跟踪算法研究[J]. 计算机工程与应用, 2021, 57(8): 169-174.
[13]	李现国，冯欣欣，李建雄. 多尺度残差网络的单幅图像超分辨率重建[J]. 计算机工程与应用, 2021, 57(7): 215-221.
[14]	杨波，陶青川，董沛君. 改进Deeplab v3+网络的手术器械分割方法[J]. 计算机工程与应用, 2021, 57(7): 222-227.
[15]	梁芳烜，杨锋，卢丽云，尹梦晓. 基于卷积神经网络的脑肿瘤分割方法综述[J]. 计算机工程与应用, 2021, 57(7): 34-43.

局部语义与上下文关系的中文短文本分类算法

Chinese Short Text Classification Algorithm Based on Local Semantics and Context

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics