Object Detection Based on Deep Learning and Attention Mechanism

doi:10.3778/j.issn.1002-8331.1902-0155

Computer Engineering and Applications ›› 2019, Vol. 55 ›› Issue (17): 180-184.DOI: 10.3778/j.issn.1002-8331.1902-0155

Previous Articles Next Articles

Object Detection Based on Deep Learning and Attention Mechanism

SUN Ping, HU Xudong, ZHANG Yongjun

1.School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China
2.Geo-Science and Technology Service Network, CAS, Image Sky, Suzhou, Jiangsu 215000, China

Online:2019-09-01 Published:2019-08-30

结合注意力机制的深度学习图像目标检测

孙萍，胡旭东，张永军

1.武汉大学遥感信息工程学院，武汉 430079
2.中国科学院地理信息与文化科技产业基地中科天启，江苏苏州 215000

Abstract

Abstract: In the Convolution Neural Network（CNN）, convolutional layers are translation-invariant, which weaken the localization performance of object detector. Actually, objects usually have distinct sub-region spatial characteristics and aspect ratio characteristics, but in prevalent two-stage object detection methods, these translation-variant feature components are rarely considered. In order to optimize the feature representations, the sub-region attention bank and aspect ratio attention bank are introduced into the two-stage object detection framework and generate the corresponding attention maps to refine the original ROI features.In addition, with the aid of the attention maps, the feature dimension can be greatly reduced.The experimental results show that object detectors equipped with attention module improve the accuracy and inference speed signi cantly.

Key words: object detection, Convolution Neural Network（CNN）, attention mechanism, dimension reduction

摘要： 利用卷积神经网络进行目标检测时，提取的卷积特征具有很强的平移不变性，这将削弱模型的定位性能。事实上，目标对象通常具有不同的子区域特征和宽高比特性，但在目前流行的两阶段目标检测框架中，很少考虑这些具有平移尺度敏感性的特征成分。为了优化模型的特征表达，将在两阶段目标检测框架中引入与子区域特征和宽高比特性相关的注意力特征库，并生成注意力特征图对原始的ROI池化特征进行优化。另外，在注意力特征图的辅助下，模型特征维度可以有效地进行缩减。实验结果表明，引入注意力模块后，模型的检测精度和检测速度有明显提升。

关键词: 目标检测, 卷积神经网络（CNN）, 注意力机制, 特征降维

SUN Ping, HU Xudong, ZHANG Yongjun. Object Detection Based on Deep Learning and Attention Mechanism[J]. Computer Engineering and Applications, 2019, 55(17): 180-184.

孙萍，胡旭东，张永军. 结合注意力机制的深度学习图像目标检测[J]. 计算机工程与应用, 2019, 55(17): 180-184.

[1]	XU Hao, ZHANG Kai, TIAN Yingjie, CHONG Faguang, WANG Zichao. Review of Deep Neural Network-Based Image Caption [J]. Computer Engineering and Applications, 2021, 57(9): 9-22.
[2]	BAO Zhiqiang, XING Yu, LYU Shaoqing, HUANG Qiongdan. Improved YOLO V2 6D Object Pose Estimation Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 148-153.
[3]	ZHOU Lungang, SUN Yifeng, WANG Kun, WU Jiang, HUANG Weigui, LI Binglong. End to End Object Recognition Algorithm for Multi-attributes of Multi-values [J]. Computer Engineering and Applications, 2021, 57(9): 182-190.
[4]	ZHANG Zhentong, SHAN Yugang, YUAN Jie. Remote Sensing Image Detection Algorithm Combining Multi-scale and Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 212-216.
[5]	WANG Bo, SONG Dan, WANG Hongyu. Research on Key Technologies of UAV Autonomous Inspection System [J]. Computer Engineering and Applications, 2021, 57(9): 255-263.
[6]	XU Shaojie, CAO Chuqing, WANG Yongjuan. Application Research of Visual SLAM in Indoor Dynamic Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 175-179.
[7]	DONG Peng, ZHOU Feng, ZHAO Congcong, WANG Yafei, MI Zetian, FU Xianping. Automatic Measurement of Underwater Sea Cucumber Size Based on Binocular Vision [J]. Computer Engineering and Applications, 2021, 57(8): 271-278.
[8]	XU Degang, WANG Lu, LI Fan. Review of Typical Object Detection Algorithms for Deep Learning [J]. Computer Engineering and Applications, 2021, 57(8): 10-25.
[9]	LI Zhenxiao, SUN Wei, LIU Mingming, ZHENG Lili, CHEN Shaoying. Research on Vehicle Detection and Tracking Algorithms in Traffic Monitoring Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 103-111.
[10]	ZHAO Yuanli, LIANG Zhijian. Research on Stance Detection Based on Dual Attention Mechanism of Heteronuclear Convolution [J]. Computer Engineering and Applications, 2021, 57(8): 119-125.
[11]	ZHANG Yue, HUANG Yourui, LIU Pengkun. Research on Multi-resolution Human Pose Estimation with Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 126-132.
[12]	WANG Ling, WANG Jiapei, WANG Peng, SUN Shuangzi. Siamese Network Tracking Algorithms for Hierarchical Fusion of Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 169-174.
[13]	LI Xianguo, FENG Xinxin, LI Jianxiong. Sigle Image Super-Resolution Reconstruction Based on Multi-scale Residual Network [J]. Computer Engineering and Applications, 2021, 57(7): 215-221.
[14]	YANG Bo, TAO Qingchuan, DONG Peijun. Surgical Instrument Segmentation Method Based on Improved Deeplab v3+ Network [J]. Computer Engineering and Applications, 2021, 57(7): 222-227.
[15]	CHEN Wei, XU Yun. Research on Extraction of Biomedical Entity Relation Based on Literature Mining [J]. Computer Engineering and Applications, 2021, 57(7): 115-120.

Object Detection Based on Deep Learning and Attention Mechanism

结合注意力机制的深度学习图像目标检测

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics