YOLO-DNF：Vehicle and Pedestrian Detection Model for Assisted Vehicle Drivingly

doi:10.3778/j.issn.1002-8331.2204-0505

Abstract

Abstract: YOLO series object detection algorithms exist unbalanced accuracy and computational cost, as well as insufficient model generalization. To address these issues, this paper proposes a high accuracy and fast vehicle and pedestrian detection model based on YOLO, called YOLO-Day Night and Fast（YOLO-DNF）, which detects vehicle and pedestrian under different lighting scenarios. This paper analyses the effects of convolutional structure and network depth on the feature extraction capability and calculation cost of the backbone in relation to the convolutional neural networks used in the mainstream detection models today. Then this paper proposes an ACNet network with low computational cost and high feature extraction capability by selecting the convolutional structure arrow-block and CSP-Block for different levels of the network and determining the depth of the network by quantifying the computational cost of stacked units. In addition, the article analyses the difference in luminance between daytime and nighttime images and introduces a data enhancement strategy of HSV domain perturbation and luminance processing to enhance the detection accuracy of the model and improve the problem of insufficient generalization of the model. The experimental results show that the YOLO-DNF model achieves 32.8% detection accuracy of full time mAP at 24.36 frames per second after training in the training set of SODA10M dataset containing only daytime images, which exceeds the current mainstream detection models in terms of detection accuracy and speed. The nighttime accuracy reaches 27.7%, which improves the nighttime detection capability of the model and extends the detection application scenarios of the model.

Key words: object detection, convolutional neural network, data augmentation, YOLO

摘要： 为了解决YOLO系列目标检测算法存在的精度与计算成本不均衡、模型泛化性不足的问题，提出了可满足不同光照场景下目标检测需求的高精度快速的车辆与行人检测模型YOLO-Day Night and Fast（YOLO-DNF）。文中结合当下主流检测模型所使用的卷积神经网络分析卷积结构与网络深度对于主干网络特征提取能力和计算成本的影响，针对网络不同层次选取卷积结构Arrow-Block与CSP-Block搭建网络并通过量化堆叠单元的计算成本确定网络深度，提出低计算成本、高特征提取能力的ACNet网络。此外分析了白天与夜间图像的亮度差异，引入了HSV域扰动并提出亮度处理的数据增强策略，提升了模型的夜间检测精度，改善了模型泛化性不足的问题。实验结果表明：YOLO-DNF模型在SODA10M数据集仅含白天图像的训练集中训练后以每秒24.36帧的检测速率达到32.8%的全时段mAP检测精度，检测精度与速度超过目前主流检测模型。其中夜间精度达到了27.7%，扩展了模型的检测应用场景。

关键词: 目标检测, 卷积神经网络, 数据增强, YOLO

ZHANG Xiuyi, CHEN Changxing, DU Juan, LI Jia, CHENG Kuanhong. YOLO-DNF：Vehicle and Pedestrian Detection Model for Assisted Vehicle Drivingly[J]. Computer Engineering and Applications, 2022, 58(24): 212-222.

张修懿, 陈长兴, 杜娟, 李佳, 成宽洪. 辅助驾驶的车辆与行人检测模型YOLO-DNF[J]. 计算机工程与应用, 2022, 58(24): 212-222.

References

[1] 程婧怡，段先华，朱伟.改进YOLOv3的金属表面缺陷检测研究[J].计算机工程与应用，2021，57（19）：252-258.
CHENG J Y，DUAN X H，ZHU W.Research on metal surface defect detection by improved YOLOv3[J].Computer Engineering and Applications，2021，56（19）：252-258.
[2] CHEN K B，XUAN Y，LIN A J，et al.Esophageal cancer detection based on classification of gastrointestinal CT images using improved faster RCNN[J].Computer Methods and Programs in Biomedicine，2021，207：106172.
[3] DAI X，HU J，ZHANG H，et al.Multi-task faster R-CNN for nighttime pedestrian detection and distance estimation[J].Infrared Physics & Technology，2021，115（4）：103694.
[4] DALAL N.Histograms of oriented gradients for human detection[C]//Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition，2005.
[5] BAY H，TUYTELAARS T，GOOL L V.SURF：speeded up robust features[C]//Proceedings of the 9th European Conference on Computer Vision-Volume Part I.[S.l.]：Springer-Verlag，2006.
[6] HEARST M A，DUMAIS S T，OSMAN E，et al.Support vector machines[J].IEEE Intelligent Systems & Their Applications，1998，13（4）：18-28.
[7] FELZENSZWALB P F.Object detection with discriminatively trained part-based models[J].IEEE Transactions on Pattern Analysis & Machine Intelligence，2010，32（9）：1627-1645.
[8] GIRSHICK R，DONAHUE J，DARRELL T，et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Computer Vision and Pattern Recognition，2014：580-587.
[9] GIRSHICK R.Fast R-CNN[C]//IEEE International Conference on Computer Vision，2015：1440-1448.
[10] REN S，HE K，GIRSHICK R，et al.Faster R-CNN：towards real-time object detection with region proposal networks[C]//International Conference on Neural Information Processing Systems，2015：91-99.
[11] REDMON J，DIVVALA S，GIRSHICK R，et al.You only look once：unified，real-time object detection[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition（CVPR）.New York：IEEE，2016：779-788.
[12] BOCHKOVSKIY A，WANG C Y，LIAO H Y.Yolov4：optimal speed and accuracy of object detection[EB/OL].（2020-04-23）[2021-08-24].https：//arxiv.org/pdf/2004.10934.pdf.
[13] WANG C Y，BOCHKOVSKIY A，LIAO H.Scaled-YOLOv4：scaling cross stage partial network [EB/OL].（2020-11-16）[2021-03-31].https：//arxiv.org/pdf/2011.08036.pdf.
[14] CHENG Y，HU K，WU J，et al.A convolutional neural network based degradation indicator construction and health prognosis using bidirectional long short-term memory network for rolling bearings[J].Advanced Engineering Informatics，2021，48（1）：101247.
[15] LATINOVIC N，VUKOVIC T，PETROVIC R，et al.Implementation challenge and analysis of thermal image degradation on R-CNN face detection[J].Telfor Journal，2020，12（2）：98-103.
[16] GEIHER A，LENZ P，STILLER C，et al.Vision meets robotics：the KITTI dataset[J].International Journal of Robotics Research，2013，32（11）：1231-1237.
[17] HAN J，LIANG X.SODA10M：towards large scale object detection benchmark for autonomous driving[EB/OL].（2021-11-08）[2022-02-17].https：//arxiv.org/pdf/2106.11118.pdf.
[18] YAO Y，ZHENG L，YANG X，et al.Simulating content consistent vehicle datasets with attribute descent[M].Cham：Springer，2020.
[19] 齐春阳.基于单目视觉夜间前方车辆检测与距离研究[D].长春：吉林大学.
QI C Y.Research on vehicle detection and distance at night based on monocular vision[D].Changchun：Jilin University，2021.
[20] 涂媛雅，汤国放，张建勋.Lite-YOLOv3轻量级行人与车辆检测网络[J/OL].小型微型计算机系统，（2021-10-18）[2022-06-10].http：//kns.cnki.net/kcms/detail/21.1106.tp.
20211015.0001.002.html.
TU Y Y，TANG G F，ZHANG J X.Lite-YOLOv3 lightweight pedestrian and vehicle detection network[J/OL].Journal of Chinese Computer Systems，（2021-10-18）[2022-06-10].https：//kns.cnki.net/kcms/detail/21.1106.tp.20211015.
0001.002.html.
[21] 孟利霞.基于深度学习的车辆行人检测方法研究[D].太原：中北大学，2021.
MENG L X.Research on vehicle pedestrian detection method based on deep learning[D].Taiyuan：North University of China，2021.