计算机工程与应用 ›› 2022, Vol. 58 ›› Issue (12): 209-217.DOI: 10.3778/j.issn.1002-8331.2011-0018

• 图形图像处理 • 上一篇    下一篇

基于SSD的轻量级车辆检测网络改进

徐浩,杨德刚,蒋倩倩,何林晋   

  1. 重庆师范大学 计算机与信息科学学院,重庆 401331
  • 出版日期:2022-06-15 发布日期:2022-06-15

Improvement of Lightweight Vehicle Detection Network Based on SSD

XU Hao, YANG Degang, JIANG Qianqian, HE Linjin   

  1. School of Computer and Information Science, Chongqing Normal University, Chongqing 401331, China
  • Online:2022-06-15 Published:2022-06-15

摘要: 针对嵌入式摄像设备在执行目标检测任务过程中,对于移动中车辆的检测耗时较长无法及时反馈检测结果的问题,提出了一种基于残差连接和注意力机制的轻量级卷积网络来对SSD(single shot multibox detector)目标检测模型进行改进。采用h-swish和h-sigmoid激活函数分别替换残差块中的ReLU激活函数和通道注意力模块中的sigmoid激活函数,降低训练和推理所需计算量。根据实际应用中特定角度下车辆外形的特征为依据,重新设计SSD目标检测方法的默认框生成比例,并结合输入图像大小及特征图感受野来减少特征融合层及默认框匹配运算量。实验表明改进后的SSD检测模型在BIT-Vehicle Dataset上的平均精度均值(mean average precision,mAP)达到了94.87%,相较于经典SSD目标检测模型的mAP提升了0.83个百分点,在搭载了Intel NCS2的Raspbery PI 3+上平均处理速度达到了16?frame/s。

关键词: 车辆检测, 机器视觉, SSD模型, 深度学习, 树莓派

Abstract: Aiming at the problem that the detection of moving vehicles takes a long time and the detection results cannot be timely feedback during the execution of target detection task by embedded camera equipment, a lightweight convolutional network based on residual connection and attention mechanism is proposed to improve the single shot multibox detector(SSD) target detection model. Firstly, h-swish and h-sigmoid activation functions are used to replace ReLU activation function in the residual block and sigmoid activation function in the channel attention module respectively, to reduce the computation required for training and reasoning. Secondly, based on the shape characteristics of vehicles at a specific angle in practical application, the default box generation ratio of SSD target detection method is redesigned. What’s more, the computation of feature fusion layer and default box matching is reduced by combining the input image size and the effective receptive field of feature map. Experimental results show that the mean average precision(mAP) of the improved SSD detection model on the BIT-Vehicle Dataset reaches 94.87%, which is 0.83 percentage points higher than that of classic SSD target detection model mAP, and the average processing speed reaches 16 frames per second on the Raspberry PI 3+ equipped with Intel NCS2.

Key words: vehicle detection, machine vision, single shot multibox detector model(SSD), deep learning, Raspbery PI