Design of Hardware Accelerator for Embedded Convolutional Neural Network

doi:10.3778/j.issn.1002-8331.1912-0099

Abstract

Abstract:

In recent years, neural network models become more and more complex. Aiming at the large memory space required for convolutional neural network inference calculations, which limits its deployment on embedded devices, a dynamic multi-precision fixed-point data quantization hardware structure is proposed. It uses fixed-point data instead of floating-point data during neural network inference to perform convolutional operations. The results show that compared with the static quantization strategy, using a 16 bit fixed-point dynamic quantization and parallel convolutional operation hardware architecture, data accuracy is up to 97.96%. The hardware unit area is only 13740 gates, and the memory footprint and bandwidth requirement are reduced 50%. In addition, compared with Cortex M4, which performs convolutional operations using floating-point data, the embedded system SoC performance is improved more than 90%.

Key words: convolutional neural network, embedded devices, dynamic multi-precision fixed-point data quantization, parallel convolutional operation hardware architecture

摘要：

近年来，随着神经网络模型越来越复杂，针对卷积神经网络推理计算所需内存空间过大，限制其在嵌入式设备上部署的问题，提出一种动态多精度定点数据量化硬件结构，使用定点数代替训练后推理过程中的浮点数执行卷积运算。结果表明，采用16位动态定点量化和并行卷积运算硬件架构，与静态量化策略相比，数据准确率高达97.96%，硬件单元的面积仅为13 740门，且内存占用量和带宽需求减半。相比Cortex M4使用浮点数据做卷积运算，该硬件加速单元性能提升了90%以上。

关键词: 卷积神经网络, 嵌入式设备, 动态多精度定点数据量化, 并行卷积运算硬件架构

TANG Rui, JIAO Jiye, XU Huahao. Design of Hardware Accelerator for Embedded Convolutional Neural Network[J]. Computer Engineering and Applications, 2021, 57(4): 252-257.

唐蕊，焦继业，徐华昊. 面向嵌入式的卷积神经网络硬件加速器设计[J]. 计算机工程与应用, 2021, 57(4): 252-257.

[1]	MOU Qingping, ZHANG Ying, ZHANG Dongbo, WANG Xinjie, YANG Zhiqiao. Research on Visual Tracking Algorithm and Application of Target Loss Discrimination Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 140-147.
[2]	BAO Zhiqiang, XING Yu, LYU Shaoqing, HUANG Qiongdan. Improved YOLO V2 6D Object Pose Estimation Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 148-153.
[3]	HUANG Dongyi, YANG Bing, WU Zihao, KUANG Jiayi, YAN Zeming. Spatio-Temporal Fully Connected Convolutional Neural Networks for Citywide Cellular Prediction [J]. Computer Engineering and Applications, 2021, 57(9): 168-175.
[4]	ZHAO Zhiyan, YANG Hua, HU Zhiwei, YU Haiping. Identification Model of Pests on Yuluxiang Pear Leaves Based on TACNN [J]. Computer Engineering and Applications, 2021, 57(9): 176-181.
[5]	ZHOU Lungang, SUN Yifeng, WANG Kun, WU Jiang, HUANG Weigui, LI Binglong. End to End Object Recognition Algorithm for Multi-attributes of Multi-values [J]. Computer Engineering and Applications, 2021, 57(9): 182-190.
[6]	ZHANG Cheng, DAI Junfeng, XIONG Wenxin. Improved Handwritten Date Recognition in Scanned Documents Combined with LeNet-5 [J]. Computer Engineering and Applications, 2021, 57(9): 207-211.
[7]	MA Zhexu, YANG Feng, QIAO Xu. Intelligent Detection Method of Railway Subgrade Defect [J]. Computer Engineering and Applications, 2021, 57(9): 272-278.
[8]	RAN Rong, XU Xinghua, QIU Shaohua, CUI Xiaopeng, OUYANG Bin. Review of Crack Detection Methods Based on Deep Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(9): 23-35.
[9]	ZHANG Yue, HUANG Yourui, LIU Pengkun. Research on Multi-resolution Human Pose Estimation with Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 126-132.
[10]	LIANG Fangxuan, YANG Feng, LU Liyun, YIN Mengxiao. Review of Brain Tumor Segmentation Methods Based on Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(7): 34-43.
[11]	YANG Peiwei, ZHOU Yuhong, XING Gang, TIAN Zhiqiang, XU Xiayu. Applications of Convolutional Neural Network in Biomedical Image [J]. Computer Engineering and Applications, 2021, 57(7): 44-58.
[12]	CHANG Hao, CHEN Xiaolei, ZHANG Aihua, LI Ce, LIN Dongmei. Continuous Blood Pressure Prediction Based on Improved SENet Convolutional Neural Network [J]. Computer Engineering and Applications, 2021, 57(7): 130-135.
[13]	WANG Chong, HAN Zhenqi, XU Haoyu, ZHU Yongxin, XU Sheng, CHEN Xia. Efficient Crack Detection Algorithm Based on Improved Saliency Map [J]. Computer Engineering and Applications, 2021, 57(6): 219-224.
[14]	HUANG Jinjie, LIN Jiangquan, HE Yongjun, HE Jinjie, WANG Yajun. Chinese Short Text Classification Algorithm Based on Local Semantics and Context [J]. Computer Engineering and Applications, 2021, 57(6): 94-100.
[15]	ZHANG Liang, ZHANG Zeng, SHU Weihua, MEI Kuizhi. Convolutional Layered Pruning Based on YOLOv3 [J]. Computer Engineering and Applications, 2021, 57(6): 131-137.

Design of Hardware Accelerator for Embedded Convolutional Neural Network

面向嵌入式的卷积神经网络硬件加速器设计

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics