计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (13): 100-112.DOI: 10.3778/j.issn.1002-8331.2411-0328

• 目标检测专题 • 上一篇    下一篇

SCE-YOLO:改进YOLOv8的轻量级无人机视觉检测算法

张帅,王波涛,涂嘉怡,陈聪实   

  1. 北京工业大学 信息科学技术学院,北京 100124
  • 出版日期:2025-07-01 发布日期:2025-06-30

SCE-YOLO: Improved Lightweight YOLOv8 Algorithm for UAV Visual Detection

ZHANG Shuai, WANG Botao, TU Jiayi, CHEN Congshi   

  1. School of Information Science and Technology, Beijing University of Technology, Beijing 100124, China
  • Online:2025-07-01 Published:2025-06-30

摘要: 针对无人机航拍场景下的目标检测模型计算复杂、检测效果不佳等问题,提出一种改进YOLOv8的轻量级无人机目标检测算法SCE-YOLO。使用STA_C2f替换骨干网络中的C2f模块,提高模型的特征提取能力;将采用渐进重参数化方法改进的AIFI模块作为空间金字塔池化层,实现高质量的尺度特征交互;提出一种多尺度特征聚合扩散网络UAV_CFDPN,根据航拍小目标的尺度特征优化网络结构,设计特征聚合模块FAM以及新的特征聚合与扩散路径,使得模型获得丰富的多尺度特征和上下文信息,提高目标检测的尺度适应性;设计一种高效共享卷积模块ES-Head,在保持定位和分类能力的同时,使得模型更加轻量高效。在VisDrone2019数据集上进行测试,实验结果表明,相较于YOLOv8s,虽然提出的SCE-YOLO算法mAP50减少0.5个百分点,但参数量和计算量仅为YOLOv8s的10.0%和48.8%,在检测精度和轻量化方面相较于其他先进算法具有明显的优势。

关键词: 目标检测, YOLOv8, 多尺度特征, 特征聚合, 轻量化

Abstract: Aiming at the problems of complicated calculation and poor detection effect of target detection model in UAV aerial photography scenes, this paper proposes a lightweight UAV target detection algorithm SCE-YOLO based on improved YOLOv8. Firstly, STA_C2f is used to replace the C2f module in the backbone network to improve the feature extraction capability of the model. Secondly, the improved AIFI module using the progressive reparameterization method is used as the spatial pyramid pooling layer to achieve high-quality scale feature interaction. Thirdly, a multi-scale feature aggregation diffusion network UAV_CFDPN is proposed, which optimizes the network structure according to the scale characteristics of small aerial targets, and designs the feature aggregation module FAM and a new feature aggregation and diffusion path, so that the model can obtain rich multi-scale features and context information, and improve the scale adaptability of target detection. Finally, an efficient shared convolution module ES-Head is designed, which makes the model more lightweight and efficient while maintaining the localization and classification capabilities. Experimental results on the VisDrone2019 dataset show that compared with YOLOv8s, although the mAP50 of the SCE-YOLO algorithm proposed in this paper is reduced by 0.5 percentage points, the number of parameters and the amount of computation are only 10.0% and 48.8% of those of YOLOv8s, which has obvious advantages over other advanced algorithms in terms of detection accuracy and lightweighting.

Key words: object detection, YOLOv8, multi-scale feature, feature aggregation, lightweight