计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (14): 112-122.DOI: 10.3778/j.issn.1002-8331.2412-0303

• 目标检测专题 • 上一篇    下一篇

YOLO-SWR:无人机视角下轻量级交通车辆检测算法

王泉,叶广飞,陈祺东   

  1. 1.南京信息工程大学 计算机学院,南京 210014 
    2.无锡学院 物联网工程学院,江苏 无锡 214105
  • 出版日期:2025-07-15 发布日期:2025-07-15

YOLO-SWR: Lightweight Traffic Vehicle Detection Algorithm from UAV Perspective

WANG Quan, YE Guangfei, CHEN Qidong   

  1. 1.School of Computer Science, Nanjing University of Information Engineering, Nanjing 210014, China
    2.School of Internet of Things Engineering, Wuxi University, Wuxi, Jiangsu 214105, China
  • Online:2025-07-15 Published:2025-07-15

摘要: 在智能交通系统中,无人机视角的车辆检测因其灵活性和高效性,受到越来越多的关注。无人机的车辆检测面临目标尺寸小、尺度变化大、复杂背景干扰、计算资源有限等问题,如何在有限的计算资源下提升检测精度是一个重要的挑战。针对以上问题,提出了一种精度更高且轻量的车辆检测模型YOLO-SWR。在保持三尺度检测的前提下增加了针对小目标的检测层,充分提取小目标的位置信息和细节特征,同时采用共享轻量检测头,减少网络参数量和计算量;在骨干和颈部网络中使用小波变换替代传统卷积,利用低频和高频分量提取多尺度特征;在C3k2模块中集成RDS模块,加强从网络高层的可扩展感受野中提取特征,并通过残差结构和深度可分离卷积融合浅层特征和深层特征,同时缓解梯度消失问题;采用Soft-NMS策略优化目标框筛选过程,提升细节特征的处理精度。实验结果表明,基于筛选后的VisDrone2019数据集,提出的模型YOLO-SWR比基础模型YOLOv11n,在mAP方面提升了9.6个百分点,同时参数量减少了56%。因此,提出的模型YOLO-SWR在精度与轻量化方面均具有显著优势。

关键词: 小目标检测, 轻量化, 交通车辆, 目标跟踪, 无人机视角

Abstract: In intelligent transportation systems, vehicle detection from the perspective of unmanned aerial vehicle (UAV) has attracted more and more attention due to its flexibility and efficiency. Vehicle detection by UAVs faces problems such as small target size, large scale variation, complex background interference, and limited computational resources. How to improve the detection accuracy with limited computing resources is an important challenge. To solve the above problems, this paper proposes YOLO-SWR, a vehicle detection model with higher accuracy and lightweightness. Firstly, the model adds a detection layer for small objects, maintains the three-scale detection to fully extract the location information and detailed features of small objects, and adopts a shared lightweight detection head to reduce the number of network parameters and computational complexity. Secondly, wavelet pooling is used in the backbone and neck networks instead of traditional convolutions, leveraging both low-frequency and high-frequency components to improve multi-scale feature extraction. Furthermore, the RDS module is integrated into the C3k2 module to strengthen the extraction of features from the scalable receptive field at the high-level of the network, and the shallow features and deep features are fused through the residual structure and depthwise separable convolution, while alleviating the gradient disappearance problem. Finally, the Soft-NMS strategy is used to optimize the bounding box selection process to improve the processing precision of detailed features. Experimental results show that based on the filtered VisDrone2019 dataset, the proposed model YOLO-SWR improves mAP by 9.6 percentage points and reduces the number of parameters by 56%, compared with the base model YOLOv11n. Therefore, the YOLO-SWR model proposed in this paper has significant advantages in both accuracy and lightweightness.

Key words: small object detection, lightweight, traffic vehicle, object tracking, UAV perspective