计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (10): 267-278.DOI: 10.3778/j.issn.1002-8331.2407-0401

• 图形图像处理 • 上一篇    下一篇

双向多尺度特征融合的无人机检测算法

汤栎,贾渊,张玉宁   

  1. 西南科技大学 计算机科学与技术学院,四川 绵阳 621010
  • 出版日期:2025-05-15 发布日期:2025-05-15

UAV Detection Algorithm Based on Bidirectional Multi-Scale Feature Fusion

TANG Yue, JIA Yuan, ZHANG Yuning   

  1. College of Computer Science and Technology, Southwest University of Science and Technology, Mianyang, Sichuan 621010, China
  • Online:2025-05-15 Published:2025-05-15

摘要: 针对在公园、学校、机场等公共区域的复杂背景中,“黑飞”无人机目标尺度多变、模糊和遮挡等导致不易识别的问题,提出一种改进YOLOv8n的无人机目标检测算法。通过在C2f模块中融合RepViTBlock结构和高效多尺度注意力机制EMA(efficient multi-scale attention)改进Bottleneck块,设计了C2f-RVB模块,在减少参数的情况下增强模型对多尺度特征信息的提取能力;在颈部特征融合网络中构建动态边界融合模块DBFFPN(dynamic boundary fusion feature pyramid network),并新增小目标检测层,聚合浅层的边界信息和深层的语义信息,提高模型抗遮挡检测能力;在损失函数部分,提出MFShape-IoU替换原模型CIoU,使得模型更关注边界框自身形状和尺度信息,聚集困难样本,提高目标定位精度。在公开数据集CBD上进行实验,结果表明,改进后的算法相比YOLOv8n,mAP@0.5提升4.1个百分点达到93.0%,mAP@0.5:0.95提升4.2个百分点达到57.1%。同时算法精度高于YOLOv8s,复杂度远低于YOLOv8s,符合在移动设备上部署的需求。

关键词: 复杂背景, 反无人机, 小目标检测, YOLOv8, 注意力机制

Abstract: A UAV target detection algorithm with improved YOLOv8n is proposed to address the difficulty of recognizing “black-flying” UAV targets in complex environments such as parks, schools, and airports, where targets vary in scale, are blurred, and occluded. The C2f-RVB module is designed by fusing the RepViTBlock structure and efficient multi-scale attention (EMA) to improve the Bottleneck block in the C2f module, enhancing the multi-scale feature extraction capabilities of the model with fewer parameters. Furthermore, a dynamic boundary fusion module DBFFPN (dynamic boundary fusion feature pyramid network) is constructed in the neck feature fusion network, and a new small target detection layer is added to aggregate shallow boundary information and deep semantic information, thereby enhancing the ability of the model to detect obscuration. In the loss function section, MFShape-IoU is proposed as a replacement for the original model CIoU, encourages the model to focus on the shape and scale of bounding box, while aggregating challenging samples, ultimately improving target localization accuracy. Experiments are conducted on the public dataset CBD, and the results demonstrate that the enhanced algorithm exhibits improvements ranging from 4.1 percentage points to 93.0% on mAP@0.5, and 4.2 percentage points to 57.1% on mAP@0.5:0.95 in comparison to YOLOv8n. Meanwhile, the algorithm exhibits higher accuracy compared to YOLOv8s and significantly lower complexity, meets the needs of deployment on mobile devices.

Key words: complex background, anti-drone, small target detection, YOLOv8, attention mechanism