Small Object Detection Algorithm Based on Improved YOLOv5 in UAV Image
XIE Chunhui, WU Jinming, XU Huaiyu
1.Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai 201210, China
2.School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
XIE Chunhui, WU Jinming, XU Huaiyu. Small Object Detection Algorithm Based on Improved YOLOv5 in UAV Image[J]. Computer Engineering and Applications, 2023, 59(9): 198-206.
[1] HE J,ERFANI S,MA X,et al.Alpha-IoU:a family of power intersection over union losses for bounding box regression[C]//Advances in Neural Information Processing Systems,2021:20230-20242.
[2] KISANTAL M,WOJNA Z,MURAWSKI J,et al.Augmentation for small object detection[J].arXiv:1902.07296,2019.
[3] CHEN C,ZHANG Y,LV Q,et al.RRNet:a hybrid detector for object detection in drone?captured images[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops.Los Alamitos:IEEE,2019:100?108.
[4] YU X,GONG Y,JIANG N,et al.Scale match for tiny person detection[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.Los Alamitos:IEEE,2020:1257?1265.
[5] CHEN Y,ZHANG P,LI Z,et al.Stitcher:feedback-driven data provider for object detection[J].arXiv:2004.12432,2020.
[6] LIN T Y,DOLLáR P,GIRSHICK R,et al.Feature pyramid networks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2017:2117-2125.
[7] DENG C,WANG M,LIU L,et al.Extended feature pyramid network for small object detection[J].IEEE Transactions on Multimedia,2021,24:1968-1979.
[8] 李青援,邓赵红,罗晓清,等.注意力与跨尺度融合的SSD目标检测算法[J].计算机科学与探索,2022,16(11):2575-2586.
LI Q Y,DENG Z H,LUO X Q,et al.SSD object detection algorithm with attention and cross-scale fusion[J].Journal of Frontiers of Computer Science and Technology,2022,16(11):2575-2586.
[9] 梁延禹,李金宝.多尺度非局部注意力网络的小目标检测算法[J].计算机科学与探索,2020,14(10):1744-1753.
LIANG Y Y,LI J B.Small objects detection method based on multi-scale non-local attention network[J].Journal of Frontiers of Computer Science and Technology,2020,14(10):1744-1753.
[10] ZHU X,LYU S,WANG X,et al.TPH-YOLOv5:improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision,2021:2778-2788.
[11] YANG C,HUANG Z,WANG N.QueryDet:cascaded sparse query for accelerating high-resolution small object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2022:13668-13677.
[12] JADERBERG M,SIMONYAN K,ZISSERMAN A.Spatial transformer networks[C]//Advances in Neural Information Processing Systems,2015.
[13] DAI J,QI H,XIONG Y,et al.Deformable convolutional networks[C]//Proceedings of the IEEE International Conference on Computer Vision,2017:764-773.
[14] HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2018:7132-7141.
[15] WANG Q,WU B,ZHU P,et al.ECA-Net:efficient channel attention for deep convolutional neural networks[J].arXiv:1910.03151,2019.
[16] HOU Q,ZHOU D,FENG J.Coordinate attention for efficient mobile network design[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2021:13713-13722.
[17] WOO S,PARK J,LEE J Y,et al.Cbam:convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision(ECCV),2018:3-19.
[18] FU J,LIU J,TIAN H,et al.Dual attention network for scene segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2019:3146-3154.
[19] HUANG Z,WANG X,HUANG L,et al.Ccnet:criss-cross attention for semantic segmentation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision,2019:603-612.
[20] GE Z,LIU S,WANG F,et al.Yolox:exceeding yolo series in 2021[J].arXiv:2107.08430,2021.
[21] LI Z,PENG C,YU G,et al.Light-head R-CNN:in defense of two-stage object detector[J].arXiv:1711. 07264,2017.
[22] CAI Z,VASCONCELOS N.Cascade R-CNN:delving into high quality object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2018:6154-6162.
[23] LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for dense object detection[C]//Proceedings of the IEEE International Conference on Computer Vision,2017:2980-2988.
[24] LAW H,DENG J.Cornernet:detecting objects as paired keypoints[C]//Proceedings of the European Conference on Computer Vision(ECCV),2018:734-750.
[25] MUHAMMAD M B,YEASIN M.Eigen-cam:class activation map using principal components[C]//2020 International Joint Conference on Neural Networks(IJCNN),2020:1-7.