Algorithm of Text Similarity Analysis Based on Capsule-BiGRU

doi:10.3778/j.issn.1002-8331.2004-0253

Abstract

Abstract:

Aiming at the problem that the traditional neural network model cannot extract the features of the text well, a text similarity analysis method based on capsule-BiGRU is proposed. The local features matrix of the text extracted by the capsule network and the global features matrix of the text extracted by the BiGRU are analyzed for similarity separately to obtain the similarity matrix of the text, to judge the similarity of text. The traditional capsule network is improved, words that have nothing to do with text semantics are regarded as noise capsules, and smaller weights are assigned to reduce the impact on subsequent tasks. For the task of text similarity, a co-attention?mechanism is added before feature extraction. For two texts to be analyzed, weights are given by calculating the similarity between words in one text and all words in another text, so that determine the similarity of text more accurately. Experiment with the Quora Questions Pairs dataset. The experimental results show that the proposed method has an accuracy rate of 86.16% and an F1 value of 88.77%, which is better than other methods.

Key words: text similarity, capsule, BiGRU, attention mechanism

摘要：

针对传统神经网络模型不能很好地提取文本特征的问题，提出基于capsule-BiGRU的文本相似度分析方法，该方法将胶囊网络（capsule）提取的文本的局部特征矩阵和双向门控循环单元网络（BiGRU）提取的文本的全局特征矩阵分别进行相似度分析，得到文本的相似度矩阵，将相似度矩阵融合，得到两个文本的多层次相似度向量，从而进行文本相似度的判定。将传统的胶囊网络进行改进，把与文本语义无关的单词视为噪声胶囊，赋予较小权值，从而减轻对后续任务的影响。针对文本相似度的任务，在文本特征矩阵提取前加入互注意力机制，对于待分析的两个文本，通过计算一个文本中单词与另一文本中所有单词的相似度来对词向量赋予权值，从而能更准确地判断文本的相似度。在Quora Questions Pairs数据集进行实验，实验结果表明所提出的方法准确率为86.16%，F1值为88.77%，结果优于其他方法。

关键词: 文本相似度, 胶囊网络, 双向门控循环单元网络, 注意力机制

ZHAO Qi, DU Yanhui, LU Tianliang, SHEN Shaoyu. Algorithm of Text Similarity Analysis Based on Capsule-BiGRU[J]. Computer Engineering and Applications, 2021, 57(15): 171-177.

赵琪，杜彦辉，芦天亮，沈少禹. 基于Capsule-BiGRU的文本相似度分析算法[J]. 计算机工程与应用, 2021, 57(15): 171-177.

[1]	XU Hao, ZHANG Kai, TIAN Yingjie, CHONG Faguang, WANG Zichao. Review of Deep Neural Network-Based Image Caption [J]. Computer Engineering and Applications, 2021, 57(9): 9-22.
[2]	ZHANG Zhentong, SHAN Yugang, YUAN Jie. Remote Sensing Image Detection Algorithm Combining Multi-scale and Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 212-216.
[3]	ZHAO Yuanli, LIANG Zhijian. Research on Stance Detection Based on Dual Attention Mechanism of Heteronuclear Convolution [J]. Computer Engineering and Applications, 2021, 57(8): 119-125.
[4]	ZHANG Yue, HUANG Yourui, LIU Pengkun. Research on Multi-resolution Human Pose Estimation with Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 126-132.
[5]	WANG Ling, WANG Jiapei, WANG Peng, SUN Shuangzi. Siamese Network Tracking Algorithms for Hierarchical Fusion of Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 169-174.
[6]	YANG Bo, TAO Qingchuan, DONG Peijun. Surgical Instrument Segmentation Method Based on Improved Deeplab v3+ Network [J]. Computer Engineering and Applications, 2021, 57(7): 222-227.
[7]	CHEN Wei, XU Yun. Research on Extraction of Biomedical Entity Relation Based on Literature Mining [J]. Computer Engineering and Applications, 2021, 57(7): 115-120.
[8]	LI Hui, ZHANG Tianyuan, JIN Shuyu. Social Emotion Mining in Ancient Chinese Metrical Poetry [J]. Computer Engineering and Applications, 2021, 57(7): 171-177.
[9]	HUANG Jinjie, LIN Jiangquan, HE Yongjun, HE Jinjie, WANG Yajun. Chinese Short Text Classification Algorithm Based on Local Semantics and Context [J]. Computer Engineering and Applications, 2021, 57(6): 94-100.
[10]	LIU Bowen, FAN Chunxiao. Relation Extraction Based on CapsuleNet via Position Perception [J]. Computer Engineering and Applications, 2021, 57(6): 101-107.
[11]	ZHANG Rui, WU Boxiong, ZHANG Liyuan, ZHANG Bo. Human Trajectory Prediction Method for Complex Scenes [J]. Computer Engineering and Applications, 2021, 57(6): 138-143.
[12]	WEI Wei, YANG Ru, ZHU Ye. Target Detection of Improved CenterNet to Remote Sensing Images [J]. Computer Engineering and Applications, 2021, 57(6): 191-199.
[13]	XU Jianguo, LIU Yonghui, LIU Mengfan. Research on Semantic Role Labeling of University Policy Based on BILSTM-CRF [J]. Computer Engineering and Applications, 2021, 57(6): 207-211.
[14]	ZHANG Qianyu, YAN Dongmei, HAN Jiatong. Research on Stock Price Prediction Combined with Deep Learning and Decomposition Algorithm [J]. Computer Engineering and Applications, 2021, 57(5): 56-64.
[15]	WANG Tiangang, ZHANG Xiaobin, MA Hongye, CAI Hongwei. Early Warning of Critical Illness Based on Explicable Hierarchical Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(5): 131-138.

Algorithm of Text Similarity Analysis Based on Capsule-BiGRU

基于Capsule-BiGRU的文本相似度分析算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics