Parameter Compression of Recurrent Neural Networks Based on Time-Error

doi:10.3778/j.issn.1002-8331.1810-0376

Abstract

Abstract: Recurrent neural networks are widely used in various sequence data processing tasks, such as machine translation, speech recognition, image annotation and so on. The language model based on recurrent neural networks usually contains a large number of parameters, which limits the use of the model on mobile devices or embedded devices to some extent. Aiming at this problem, a low rank reconstruction compression method based on time-error is proposed, which adds the time-error reconstruction function on the basis of low rank reconstruction compression, and the input activation mechanism of long short-term memory network is adopted. Numerical experiments on multiple data sets show that the proposed method has a better effect on compression.

Key words: recurrent neural networks, long short-term memory, low rank reconstruction compression, low rank reconstruction compression based on time-error

摘要： 循环神经网络被广泛应用于各种序列数据处理任务中，如机器翻译、语音识别、图像标注等。基于循环神经网络的语言模型通常包含大量的参数，这一点在一定程度上限制了模型在移动设备或嵌入式设备上的使用。在低秩重构压缩的基础上，增加时间误差重构函数，并采用长短时记忆网络中的输入激活机制，提出了一种基于时间误差的低秩重构压缩方法。多个数据集上的数值实验表明，该方法具有较好的压缩效果。

关键词: 循环神经网络, 长短时记忆网络, 低秩重构压缩, 基于时间误差的低秩重构压缩

WANG Longgang, LIU Shijie, FENG Shanshan, LI Hongwei. Parameter Compression of Recurrent Neural Networks Based on Time-Error[J]. Computer Engineering and Applications, 2020, 56(3): 134-138.

王龙钢，刘世杰，冯珊珊，李宏伟. 基于时间误差的循环神经网络参数压缩[J]. 计算机工程与应用, 2020, 56(3): 134-138.

[1]	YANG Chunxia, LI Xinxu, WU Jiajun, LIU Tianyu. Hierarchical Network Sentiment Classification Based on Attention Interaction Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 134-139.
[2]	LI Zhenxiao, SUN Wei, LIU Mingming, ZHENG Lili, CHEN Shaoying. Research on Vehicle Detection and Tracking Algorithms in Traffic Monitoring Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 103-111.
[3]	YANG Qian, GU Lei. Chinese Named Entity Recognition Based on Denoising Joint Character-Word Model [J]. Computer Engineering and Applications, 2021, 57(7): 151-157.
[4]	HUANG Jinjie, LIN Jiangquan, HE Yongjun, HE Jinjie, WANG Yajun. Chinese Short Text Classification Algorithm Based on Local Semantics and Context [J]. Computer Engineering and Applications, 2021, 57(6): 94-100.
[5]	XU Jianguo, LIU Yonghui, LIU Mengfan. Research on Semantic Role Labeling of University Policy Based on BILSTM-CRF [J]. Computer Engineering and Applications, 2021, 57(6): 207-211.
[6]	YAO Honggang, MU Nianguo. Prediction of Financial Time Series by EMD-LSTM Model [J]. Computer Engineering and Applications, 2021, 57(5): 239-244.
[7]	XU Xianfeng, CAI Lulu, ZHANG Li. Photovoltaic Power Generation Prediction Algorithm Based on MLP and DBN [J]. Computer Engineering and Applications, 2021, 57(3): 266-272.
[8]	TENG Jinbao, KONG Weiwei, TIAN Qiaoxin, WANG Zhaoqian, LI Long. Multi-channel Attention Mechanism Text Classification Model Based on CNN and LSTM [J]. Computer Engineering and Applications, 2021, 57(23): 154-162.
[9]	YI Lingzhi, WANG Shitong, YI Fang, DENG Dong, YI Zhimin, JIANG Peng. Wind Farm Ultra-Short-Term Wind Speed Prediction Based on EEMDSE-ILSTM [J]. Computer Engineering and Applications, 2021, 57(22): 288-294.
[10]	WU Minghui, HOU Lingyan, WANG Chao. Improved Mechanism of Prediction-Oriented Long Short-Term Memory Neural Network [J]. Computer Engineering and Applications, 2021, 57(21): 109-115.
[11]	JIANG Kui, QIU Yuandong, ZHENG Haocheng. ICMPv6 DDoS Attack Detection Method Based on Information Entropy and LSTM [J]. Computer Engineering and Applications, 2021, 57(21): 148-154.
[12]	GENG Lixiao, LIU Lisha, LI Hengyu. Research on Stock Index Prediction Driven by Multi-source Heterogeneous Data Fusion [J]. Computer Engineering and Applications, 2021, 57(20): 142-149.
[13]	CAO Lei, LI Zhanbin, YANG Yongsheng, ZHAO Longfei. Intrusion Detection Method Based on Two-Layer Attention Networks [J]. Computer Engineering and Applications, 2021, 57(19): 142-149.
[14]	LI Wenliang, YANG Qiuxiang, QIN Quan. Multi-feature Mixed Model Text Sentiment Analysis Method [J]. Computer Engineering and Applications, 2021, 57(19): 205-213.
[15]	DING Yuyang, LI Mingyue, XIE Ningyu, LIU Yuan, YAN Tao. Research of Dual LSTM Method for Rain Streaks Removal on Light Field Images [J]. Computer Engineering and Applications, 2021, 57(18): 227-237.

Parameter Compression of Recurrent Neural Networks Based on Time-Error

基于时间误差的循环神经网络参数压缩

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics