Abstractive Summarization Model Based on Mixture Attention and Reinforcement Learning

doi:10.3778/j.issn.1002-8331.1907-0101

Abstract

Abstract: RNN-based sequence-to-sequence models have achieved good performance on abstractive summarization. However, these models have some shortcomings including repetitive and exposure bias. A model is presented based on mixed attention including temporal attention and decoding self-attention saving history attention and adding attention for the decoded word to optimize repetition problem. Reinforcement learning is used as a new training method to solve the problem of exposure bias, and modifying the loss function to improve the result. The proposed method is tested using CNN/Daily Mail data set by ROUGE, showing that mixed attention can improve the repetition problem, and the exposure bias can be eliminated by reinforcement learning, and the integrated model surpasses the advanced algorithm on the test set.

Key words: abstractive summarization, mixture attention, reinforcement learning, natural language processing, exposure bias, recursive neural network

摘要： 基于递归神经网络的序列到序列的模型在文本摘要生成任务中取得了非常好的效果，但这类模型大多存在生成文本重复、曝光偏差等问题。针对重复问题，提出一种由存储注意力和解码自注意力构成的混合注意力，通过存储历史注意力和增加对历史生成单词的注意力来克服该问题；使用强化学习作为一种新的训练方式来解决曝光偏差问题，同时修正损失函数。在CNN/Daily Mail数据集对模型进行测试，以ROUGE为评价指标，结果证明了混合注意力对重复问题有较大的改善，借助强化学习可以消除曝光偏差，整合后的模型在测试集上超越先进算法。

关键词: 文本摘要生成, 混合注意力, 强化学习, 自然语言处理, 曝光偏差, 递归神经网络

DANG Hongshe, TAO Yafan, ZHANG Xuande. Abstractive Summarization Model Based on Mixture Attention and Reinforcement Learning[J]. Computer Engineering and Applications, 2020, 56(1): 185-190.

党宏社，陶亚凡，张选德. 基于混合注意力与强化学习的文本摘要生成[J]. 计算机工程与应用, 2020, 56(1): 185-190.

[1]	LIU Bowen, FAN Chunxiao. Relation Extraction Based on CapsuleNet via Position Perception [J]. Computer Engineering and Applications, 2021, 57(6): 101-107.
[2]	WANG Xiao, TANG Lun, HE Xiaoyu, CHEN Qianbin. Multi-dimensional Resource Optimization of Service Function Chain Based on Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(4): 68-76.
[3]	LIAO Wenxiong, ZENG Bi, XU Yayun. Natural Language Processing Model Based on One-Dimensional Dilated Convolution and Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(4): 114-119.
[4]	LAI Jun, WEI Jingyi, CHEN Xiliang. Overview of Hierarchical Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(3): 72-79.
[5]	MA Zhihao, ZHU Xiangbin. Research on Quasi-hyperbolic Momentum Gradient for Adversarial Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(24): 90-99.
[6]	LI Baoshuai, YE Chunming. Job Shop Scheduling Problem Based on Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(23): 248-254.
[7]	JIANG Yangyang, JIN Bo, ZHANG Baochang. Research Progress of Natural Language Processing Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(22): 1-14.
[8]	CHEN Xiaohan, WEI Shuning, QIN Zhengze. Malware Family Classification Based on Deep Learning Visualization [J]. Computer Engineering and Applications, 2021, 57(22): 131-138.
[9]	YUAN Xun, LIU Rong, LIU Ming. Aspect-Level Sentiment Analysis Model Incorporating Multi-layer Attention [J]. Computer Engineering and Applications, 2021, 57(22): 147-152.
[10]	WANG Jun, CAO Lei, CHEN Xiliang, LAI Jun, ZHANG Legui. Overview on Reinforcement Learning of Multi-agent Game [J]. Computer Engineering and Applications, 2021, 57(21): 1-13.
[11]	CHENG Yi, HAO Mimi. Path Planning for Indoor Mobile Robot with Improved Deep Reinforcement Learning [J]. Computer Engineering and Applications, 2021, 57(21): 256-262.
[12]	YANG Quan. SVM Algorithm for N1+N2 Structure Syntax Relation Determination [J]. Computer Engineering and Applications, 2021, 57(20): 104-108.
[13]	KUANG Liqun, LI Siyuan, FENG Li, HAN Xie, XU Qingyu. Application of Deep Reinforcement Learning Algorithm on Intelligent Military Decision System [J]. Computer Engineering and Applications, 2021, 57(20): 271-278.
[14]	LI Hao, NING Haoyu, KANG Yan, LIANG Wentao, HUO Wen. SMRFGAN Model for Text Emotion Transfer [J]. Computer Engineering and Applications, 2021, 57(2): 170-176.
[15]	KONG Songtao, LIU Chichi, SHI Yong, XIE Yi, WANG Kun. Review of Application Prospect of Deep Reinforcement Learning in Intelligent Manufacturing [J]. Computer Engineering and Applications, 2021, 57(2): 49-59.

Abstractive Summarization Model Based on Mixture Attention and Reinforcement Learning

基于混合注意力与强化学习的文本摘要生成

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics