Text Classification Method Based on LSTM-Attention and CNN Hybrid Model

doi:10.3778/j.issn.1002-8331.2011-0037

Abstract

Abstract:

For the problem that traditional Long Short-Term Memory（LSTM） and Convolution Neural Network（CNN） cannot reflect the importance of each word in the text when extracting features, a text classification method based on the hybrid model of LSTM-Attention and CNN is proposed. Firstly, CNN is used to extract the local information of the text and then integrate the semantics of the whole text. Secondly, LSTM is used to extract text context features. After LSTM, Attention mechanism is added to extract the Attention score of output information. Finally, the output of LSTM-Attention is fused with the output of CNN, so as to realize the effective extraction of text features and focus Attention on important words. The experimental results on three open data sets show that the proposed model is more effective than LSTM, CNN and their improved models, and can effectively improve the effect of text classification.

Key words: text classification, Long Short-Term Memory（LSTM）, attention mechanism, Convolution Neural Network（CNN）, feature fusion

摘要：

针对传统长短时记忆网络（Long Short-Term Memory，LSTM）和卷积神经网络（Convolution Neural Network，CNN）在提取特征时无法体现每个词语在文本中重要程度的问题，提出一种基于LSTM-Attention与CNN混合模型的文本分类方法。使用CNN提取文本局部信息，进而整合出全文语义；用LSTM提取文本上下文特征，在LSTM之后加入注意力机制（Attention）提取输出信息的注意力分值；将LSTM-Attention的输出与CNN的输出进行融合，实现了有效提取文本特征的基础上将注意力集中在重要的词语上。在三个公开数据集上的实验结果表明，提出的模型相较于LSTM、CNN及其改进模型效果更好，可以有效提高文本分类的效果。

关键词: 文本分类, 长短时记忆网络（LSTM）, 注意力机制, 卷积神经网络（CNN）, 特征融合

TENG Jinbao, KONG Weiwei, TIAN Qiaoxin, WANG Zhaoqian. Text Classification Method Based on LSTM-Attention and CNN Hybrid Model[J]. Computer Engineering and Applications, 2021, 57(14): 126-133.

滕金保，孔韦韦，田乔鑫，王照乾. 基于LSTM-Attention与CNN混合模型的文本分类方法[J]. 计算机工程与应用, 2021, 57(14): 126-133.

[1]	XU Hao, ZHANG Kai, TIAN Yingjie, CHONG Faguang, WANG Zichao. Review of Deep Neural Network-Based Image Caption [J]. Computer Engineering and Applications, 2021, 57(9): 9-22.
[2]	ZHANG Zhentong, SHAN Yugang, YUAN Jie. Remote Sensing Image Detection Algorithm Combining Multi-scale and Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 212-216.
[3]	LU Lixia, ZOU Junzhong, GUO Yucheng, ZHANG Jian, WANG Bei. Prediction of Knee Injury Based on Multimodal Fusion [J]. Computer Engineering and Applications, 2021, 57(9): 225-232.
[4]	LI Zhenxiao, SUN Wei, LIU Mingming, ZHENG Lili, CHEN Shaoying. Research on Vehicle Detection and Tracking Algorithms in Traffic Monitoring Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 103-111.
[5]	ZHAO Yuanli, LIANG Zhijian. Research on Stance Detection Based on Dual Attention Mechanism of Heteronuclear Convolution [J]. Computer Engineering and Applications, 2021, 57(8): 119-125.
[6]	ZHANG Yue, HUANG Yourui, LIU Pengkun. Research on Multi-resolution Human Pose Estimation with Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 126-132.
[7]	WANG Ling, WANG Jiapei, WANG Peng, SUN Shuangzi. Siamese Network Tracking Algorithms for Hierarchical Fusion of Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 169-174.
[8]	LI Mingshan, HAN Qingpeng, ZHANG Tianyu, WANG Daolei. Safety Helmet Detection Method of Improved SSD [J]. Computer Engineering and Applications, 2021, 57(8): 192-197.
[9]	GUO Xiaojing, SUI Haoda. Application of Improved YOLOv3 in Foreign Object Debris Target Detection on Airfield Pavement [J]. Computer Engineering and Applications, 2021, 57(8): 249-255.
[10]	CHEN Wei, XU Yun. Research on Extraction of Biomedical Entity Relation Based on Literature Mining [J]. Computer Engineering and Applications, 2021, 57(7): 115-120.
[11]	YANG Qian, GU Lei. Chinese Named Entity Recognition Based on Denoising Joint Character-Word Model [J]. Computer Engineering and Applications, 2021, 57(7): 151-157.
[12]	LI Xianguo, FENG Xinxin, LI Jianxiong. Sigle Image Super-Resolution Reconstruction Based on Multi-scale Residual Network [J]. Computer Engineering and Applications, 2021, 57(7): 215-221.
[13]	YANG Bo, TAO Qingchuan, DONG Peijun. Surgical Instrument Segmentation Method Based on Improved Deeplab v3+ Network [J]. Computer Engineering and Applications, 2021, 57(7): 222-227.
[14]	HUANG Jinjie, LIN Jiangquan, HE Yongjun, HE Jinjie, WANG Yajun. Chinese Short Text Classification Algorithm Based on Local Semantics and Context [J]. Computer Engineering and Applications, 2021, 57(6): 94-100.
[15]	ZHANG Rui, WU Boxiong, ZHANG Liyuan, ZHANG Bo. Human Trajectory Prediction Method for Complex Scenes [J]. Computer Engineering and Applications, 2021, 57(6): 138-143.

Text Classification Method Based on LSTM-Attention and CNN Hybrid Model

基于LSTM-Attention与CNN混合模型的文本分类方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics