Deep Neural Network Based on Attention Convolution Module for Image Recognition

doi:10.3778/j.issn.1002-8331.1812-0047

Abstract

Abstract: For deep fusion in the middle layer branches of deep neural networks, it is a challenge for recent deep neural network research to generate a basic network that can share useful information, thereby optimizing information flow and improving the performance of deep neural networks. In this paper, the deep neural network based on attention convolution module is proposed. The proposed module is mainly divided into two parts：the trunk branch and the soft branch. On the trunk branch, it consists of two sets of residual modules, making the module suitable for other deep neural networks. On the soft branch, the given intermediate feature map is taken along two dimensions （space and channel） to obtain the attention feature map, and the input intermediate feature map is adjusted to strengthen useful information to suppress useless information. The proposed convolution residual module can solve the problem of inconsistent input and output size, strengthen the key information of the image and effectively promote the information flow of the network. Experiments on the cifar-10, cifar-100, ck+, AVEC2017 data sets show that the proposed method applied to the resnet-50 network has a higher recognition accuracy（0.9%~1.2%） than the method proposed by Hu when the training time difference is less than 0.3%.

Key words: image identification, residual module, attention, deep neural network

摘要： 对于在深度神经网络的中间层分支进行深度融合，产生潜在可以共享有用信息的基础网络，从而优化信息流动，提升深度神经网络的性能，是近期的深度神经网络研究的挑战。对此提出一种基于注意力卷积模块的深度神经网络的图像识别方法。改进的模块主要分为树干分支与软分支两部分，在树干分支上，由两组残差模块组成，使该模块适用于其他深度神经网络；在软分支上，将给定的中间特征图沿着两个维度（空间与通道）获取注意力特征图，对输入中间特征图进行调整，强化有用信息抑制无用信息。改进的卷积残差模块既能解决输入与输出的尺寸不一致的问题，也能强化图像的关键信息与有效促进网络的信息流动。通过对cifar-10、cifar-100、ck+、AVEC2017数据集进行实验，实验结果表明了提出的方法应用于ResNet-50网络上对比Hu提出的方法在训练耗时相差不到0.3%的情况下，识别图像准确率有0.9%~1.2%的提高。

关键词: 图像识别, 残差模块, 注意力, 深度神经网络

YUAN Jiajie, ZHANG Ling, CHEN Yunhua. Deep Neural Network Based on Attention Convolution Module for Image Recognition[J]. Computer Engineering and Applications, 2019, 55(8): 9-16.

袁嘉杰，张灵，陈云华. 基于注意力卷积模块的深度神经网络图像识别[J]. 计算机工程与应用, 2019, 55(8): 9-16.

[1]	YANG Chunxia, LI Xinxu, WU Jiajun, LIU Tianyu. Hierarchical Network Sentiment Classification Based on Attention Interaction Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 134-139.
[2]	WANG Lin, CHAI Jiangyun. Research on Deep Neural Network in Multi-scene Vehicle Attribute Recognition [J]. Computer Engineering and Applications, 2021, 57(9): 162-167.
[3]	ZHANG Zhentong, SHAN Yugang, YUAN Jie. Remote Sensing Image Detection Algorithm Combining Multi-scale and Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 212-216.
[4]	XU Hao, ZHANG Kai, TIAN Yingjie, CHONG Faguang, WANG Zichao. Review of Deep Neural Network-Based Image Caption [J]. Computer Engineering and Applications, 2021, 57(9): 9-22.
[5]	ZHAO Yang, ZHANG Junhua. Multi-scale Feature Fusion Method for Spinal X-Ray Image Segmentation [J]. Computer Engineering and Applications, 2021, 57(8): 214-219.
[6]	ZHAO Yuanli, LIANG Zhijian. Research on Stance Detection Based on Dual Attention Mechanism of Heteronuclear Convolution [J]. Computer Engineering and Applications, 2021, 57(8): 119-125.
[7]	ZHANG Yue, HUANG Yourui, LIU Pengkun. Research on Multi-resolution Human Pose Estimation with Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 126-132.
[8]	WANG Ling, WANG Jiapei, WANG Peng, SUN Shuangzi. Siamese Network Tracking Algorithms for Hierarchical Fusion of Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 169-174.
[9]	YANG Bo, TAO Qingchuan, DONG Peijun. Surgical Instrument Segmentation Method Based on Improved Deeplab v3+ Network [J]. Computer Engineering and Applications, 2021, 57(7): 222-227.
[10]	ZHU Juntao, YAO Guangle, ZHANG Gexiang, LI Jun, YANG Qiang, WANG Sheng, YE Shaoze. Survey of Few Shot Learning of Deep Neural Network [J]. Computer Engineering and Applications, 2021, 57(7): 22-33.
[11]	WEI Jihong, ZHENG Rongfeng, LIU Jiayong. Research on Malicious TLS Traffic Identification Based on Hybrid Neural Network [J]. Computer Engineering and Applications, 2021, 57(7): 107-114.
[12]	CHEN Wei, XU Yun. Research on Extraction of Biomedical Entity Relation Based on Literature Mining [J]. Computer Engineering and Applications, 2021, 57(7): 115-120.
[13]	HUANG Jinjie, LIN Jiangquan, HE Yongjun, HE Jinjie, WANG Yajun. Chinese Short Text Classification Algorithm Based on Local Semantics and Context [J]. Computer Engineering and Applications, 2021, 57(6): 94-100.
[14]	HE Yubo, LIU Kun. Detection of Sea-Surface Saliency Object Based on Convolutional Neural Network [J]. Computer Engineering and Applications, 2021, 57(6): 108-116.
[15]	ZHANG Rui, WU Boxiong, ZHANG Liyuan, ZHANG Bo. Human Trajectory Prediction Method for Complex Scenes [J]. Computer Engineering and Applications, 2021, 57(6): 138-143.

Deep Neural Network Based on Attention Convolution Module for Image Recognition

基于注意力卷积模块的深度神经网络图像识别

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics