计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (21): 105-116.DOI: 10.3778/j.issn.1002-8331.2504-0095

• YOLO改进及应用专题 • 上一篇    下一篇

改进YOLOv8n的无人机对地多尺度目标检测算法及实现

吴思,黄丹丹,刘智,王惠绩,田成军   

  1. 1.长春理工大学 电子信息工程学院,长春 130022
    2.长春理工大学 空间光电技术国家地方联合工程研究中心,长春 130022
  • 出版日期:2025-11-01 发布日期:2025-10-31

Improved UAV Multi-Scale Object Detection Algorithm Based on YOLOv8n for Ground Targets and Implementation

WU Si, HUANG Dandan, LIU Zhi, WANG Huiji, TIAN Chengjun   

  1. 1.School of Electronics and Information Engineering, Changchun University of Science and Technology, Changchun 130022, China
    2.National and Local Joint Engineering Research Center of Space Photoelectric Technology, Changchun University of Science and Technology, Changchun 130022, China
  • Online:2025-11-01 Published:2025-10-31

摘要: 针对无人机视角下,检测目标尺寸小、尺度跨度大、复杂场景导致的互相遮挡等检测难点,提出一种基于YOLOv8n的改进目标检测网络。引入空间深度卷积SPD-Conv并对其进行轻量化改进,得到SPDs-Conv,在提升小目标检测性能的同时有效降低模型的参数量;设计C2f_SKAttention模块进行多尺度信息的动态调整,实现更好的信息融合;采用一种全新的动态检测头策略,将各类目标的检测头进行了有机统一,降低由复杂场景产生的遮挡等情况对检测的影响。在VisDrone2019数据集上实验得出,改进后的模型在6.9×106参数下实现了49.7%的平均精度,较基准算法提高了15.1个百分点;并具有41?FPS的推理速度,能够实现实时检测,同时相较于其他算法,该算法能够达到更高的检测精度。改进后的模型在Jetson AGX Orin设备上也达到了49.3%的平均精度,推理速度达到28.7?FPS,验证了其效率和对实时嵌入式边缘平台目标检测任务的适用性。

关键词: 无人机, 目标检测, SPDs-Conv, C2f_SKAttention模块, 动态检测头

Abstract: In view of the challenges in target detection from an unmanned aerial vehicle (UAV) perspective, such as small target sizes, large scale variations, and mutual occlusion in complex scenarios, an improved target detection network based on YOLOv8n is proposed. the spatial depth convolution (SPD-Conv) is introduced and lightweight improvements are made to it, resulting in the SPDs-Conv. This not only enhances the performance of small target detection but also effectively reduces the number of model parameters. The C2f_SKAttention module is designed to dynamically adjust multi- scale information, achieving better information fusion. A novel dynamic detection head strategy is adopted. This strategy organically unifies the detection heads for various types of targets, reducing the impact of occlusion and other issues caused by complex scenarios on detection. Experiments on the VisDrone2019 dataset show that the improved model achieves an average precision of 49.7% with 6.9×106 parameters, which is 15.1?percentage points higher than the baseline algorithm. It also has an inference speed of 41?FPS, enabling real-time detection. Compared with other algorithms, the proposed algorithm can achieve higher detection accuracy. The improved model also achieves an average precision of 49.3% on the Jetson AGX Orin device, with an inference speed of 28.7 FPS, verifying its efficiency and applicability to real-time object detection tasks on embedded edge platforms.

Key words: unmanned aerial vehicle (UAV), object detection, SPDs-Conv, C2f_SKAttention module, dynamic detection head