Efficient Multi-Object Efficient Object Detection Method Based on Improved SSD

doi:10.3778/j.issn.1002-8331.1811-0157

Abstract

Abstract: In order to improve the defect of poor detection accuracy of the one-stage object detection algorithm, an efficient multi-target location detection algorithm FSD based on SSD is proposed. The algorithm mainly improves the one-stage object detection algorithm from two aspects: on the one hand, it designs a more efficient dense residual network, namely R-DenseNet, by adopting a narrower dense network structure form to maintain feature extraction. The capacity reduces the computational complexity, which improves the detection and convergence performance of the algorithm. On the other hand, the loss function is improved. By suppressing the weight of the easily-divided samples in the loss function, the robustness of the algorithm is improved, and the phenomenon of sample imbalance in object detection is improved. The Tensorflow deep learning framework is used to deploy the network, and experiments are carried out on Ubuntu equipped with Nvidia Titan X. Experiments show that FSD achieves the highest detection accuracy on both COCO and PASCAL VOC object detection data sets, among which FSD300 detection accuracy compared with the SSD300, there is a 3.7% improvement, and the detection phase rate is 10.87% higher than that of the SSD.

Key words: deep learning, object detection, feature fusion, sample imbalance, Convolutional Neural Network（CNN）

摘要： 为改善一阶段目标检测算法检测精度较差的缺陷，提出一种基于SSD的高效多目标定位检测算法FSD。该算法主要从两个方面对一阶段目标检测算法进行改进：设计了一个更高效的密集残差网络，即R-DenseNet，通过采用一种更窄的密集网络结构形式，在保持特征提取容量的同时降低了计算复杂度，从而提高了算法的检测和收敛性能；改进了损失函数，通过抑制易分样本在损失函数中的权重，提高算法的鲁棒性，改善了目标检测中样本失衡的现象。采用Tensorflow深度学习框架部署网络，并在搭载Nvidia Titan X的Ubuntu上开展实验，实验表明FSD在COCO和PASCAL VOC这两个目标检测数据集上上都取得了最高的检测精度，其中FSD300D的检测精度相比SSD300有3.7%提升，检测相率比SSD有10.87%提升。

关键词: 深度学习, 目标检测, 特征融合, 样本失衡, 卷积神经网（CNN）

WANG Wenguang, LI Qiang, LIN Maosong, HE Xianzhen. Efficient Multi-Object Efficient Object Detection Method Based on Improved SSD[J]. Computer Engineering and Applications, 2019, 55(13): 28-35.

王文光，李强，林茂松，贺贤珍. 基于改进SSD的高效目标检测方法[J]. 计算机工程与应用, 2019, 55(13): 28-35.

[1]	WU Wenjie, SONG Wen’ai, GAO Xuemei, YANG Jijiang, WANG Qing, HUANG Liping, LEI Yi. Review of X-Ray-Based Computer-Aided Diagnosis of Adult OSA [J]. Computer Engineering and Applications, 2021, 57(9): 1-8.
[2]	RAN Rong, XU Xinghua, QIU Shaohua, CUI Xiaopeng, OUYANG Bin. Review of Crack Detection Methods Based on Deep Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(9): 23-35.
[3]	LI Xiaoxiao, HU Xiaoguang, WANG Ziqiang, DU Zhuoqun. Survey of Instance Segmentation Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(9): 60-67.
[4]	BAO Zhiqiang, XING Yu, LYU Shaoqing, HUANG Qiongdan. Improved YOLO V2 6D Object Pose Estimation Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 148-153.
[5]	HUANG Dongyi, YANG Bing, WU Zihao, KUANG Jiayi, YAN Zeming. Spatio-Temporal Fully Connected Convolutional Neural Networks for Citywide Cellular Prediction [J]. Computer Engineering and Applications, 2021, 57(9): 168-175.
[6]	ZHOU Lungang, SUN Yifeng, WANG Kun, WU Jiang, HUANG Weigui, LI Binglong. End to End Object Recognition Algorithm for Multi-attributes of Multi-values [J]. Computer Engineering and Applications, 2021, 57(9): 182-190.
[7]	ZHANG Cheng, DAI Junfeng, XIONG Wenxin. Improved Handwritten Date Recognition in Scanned Documents Combined with LeNet-5 [J]. Computer Engineering and Applications, 2021, 57(9): 207-211.
[8]	LU Lixia, ZOU Junzhong, GUO Yucheng, ZHANG Jian, WANG Bei. Prediction of Knee Injury Based on Multimodal Fusion [J]. Computer Engineering and Applications, 2021, 57(9): 225-232.
[9]	WANG Bo, SONG Dan, WANG Hongyu. Research on Key Technologies of UAV Autonomous Inspection System [J]. Computer Engineering and Applications, 2021, 57(9): 255-263.
[10]	ZENG Chunyan, YAN Kang, WANG Zhifeng, YU Yan, JI Chunmei. Survey of Interpretability Research on Deep Learning Models [J]. Computer Engineering and Applications, 2021, 57(8): 1-9.
[11]	XU Degang, WANG Lu, LI Fan. Review of Typical Object Detection Algorithms for Deep Learning [J]. Computer Engineering and Applications, 2021, 57(8): 10-25.
[12]	JIANG Bin, ZHONG Rui, ZHANG Qiuwen, ZHANG Huanlong. Survey of Non-frontal Facial Expression Recognition by Using Deep Learning Methods [J]. Computer Engineering and Applications, 2021, 57(8): 48-61.
[13]	LI Zhenxiao, SUN Wei, LIU Mingming, ZHENG Lili, CHEN Shaoying. Research on Vehicle Detection and Tracking Algorithms in Traffic Monitoring Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 103-111.
[14]	ZHAO Yuanli, LIANG Zhijian. Research on Stance Detection Based on Dual Attention Mechanism of Heteronuclear Convolution [J]. Computer Engineering and Applications, 2021, 57(8): 119-125.
[15]	XU Shaojie, CAO Chuqing, WANG Yongjuan. Application Research of Visual SLAM in Indoor Dynamic Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 175-179.

Efficient Multi-Object Efficient Object Detection Method Based on Improved SSD

基于改进SSD的高效目标检测方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics