LMUAV-YOLOv8: Lightweight Network for Object Detection in Low-Altitude UAV Vision

doi:10.3778/j.issn.1002-8331.2407-0127

Abstract

Abstract: To tackle the challenges of weak sensing capacity and high missed detection rates for small-scale objects using low-altitude UAV in complex traffic scenarios, the LMUAV-YOLOv8 algorithm is proposed. Its efficiency and advantage are verified through ablation and comparative experiments. The internal mechanisms is visualized by using the method of class activation mapping. In this dissertation, a lightweight feature fusion network (UAV_RepGFPN) is introduced firstly, proposing new feature fusion paths and a feature fusion module DBB_GELAN, which reduces the number of parameters and computation while improving the performance of the feature fusion network. Secondly, the feature extraction module (FTA_C2f) is constructed using partial convolution (PConv) and triplet attention mechanism (Triplet Attention), and the ADown down-sampling module is introduced. By rearranging the dimensions of the input feature maps and making fine-grained adjustments, the ability of the deep network to capture spatial features is enhanced, further reducing the number of parameters and computation. Then, concerning large amount of information loss during in layer-by-layer feature extraction and spatial transformation, a new context-guided programmable gradient information (UAV_PGI) strategy is proposed. By designing a context-guided reversible architecture and three additional auxiliary detection heads, UAV_PGI significantly enhance detection capabilities for aerial objects. In order to verify the validity and generalization ability of the model, comparative experiments are carried out on the VisDrone 2019 test set, and the results show that: compared with YOLOv8s, LMUAV-YOLOv8s on the VisDrone 2019 test set improves precision, recall, mAP@0.5, and mAP@0.5:0.95 by 4.2, 3.9, 5.1, and 3.0?percentage points, separately, with the computational cost increased by only 0.4?GFLOPs and the parameter count reduced by 63.9%, meaning a good balance between performance and cost. The inference experimental results based on NVIDIA Jetson Xavier NX embedded platform show that compared with the baseline model, the proposed algorithm can obtain higher detection accuracy under the condition of meeting the requirements of real-time detection, rendering it more suitable for real-time target detection scenarios in drones. Finally, the decision making process is visualized by using the method of class activation mapping, which provides a intuitive way to understand the internal mechanisms of the networ. And the results show that the proposed model has superior small-scale feature extraction and high-resolution processing capabilities.

Key words: small object detection, multi-scale, lightweight, YOLOv8, programable gradient information

摘要： 针对低空无人机目标检测面临目标尺度变化大、小目标容易漏检和误检的挑战，发展了一种融合多尺度特征的目标检测轻量化网络（LMUAV-YOLOv8），通过开展消融和对比实验，验证了算法的有效性和先进性，并借助类激活图，对模型的决策过程进行了解释。设计了一种轻量化的特征融合网络（UAV_RepGFPN），提出新的特征融合路径以及特征融合模块DBB_GELAN，降低参数量和计算量的同时，提高特征融合网络的性能。使用部分卷积（PConv）和三重注意力机制（Triplet Attention）构建特征提取模块（FTA_C2f），并引入ADown下采样模块，通过对输入特征图维度的重新排列和细粒度调整，以提升模型中深层网络对空间特征的捕捉能力，并进一步降低参数量和计算量。优化YOLOv9的可编程梯度信息（programmable gradient information，PGI）策略，设计基于上下文引导（Context_guided）的可逆架构，并额外生成三个辅助检测头，提出UAV_PGI可编程梯度方法，避免传统深度监督中多路径特征集成可能导致的语义信息损失。为了验证模型的有效性及泛化能力，在VisDrone 2019测试集上开展了对比实验，结果显示，与YOLOv8s相比，LMUAV-YOLOv8s的准确度、召回率、mAP@0.5和mAP@0.5：0.95等指标分别提升了4.2、3.9、5.1和3.0个百分点，同时参数量减少了63.9%，计算量仅增加0.4?GFLOPs，实现了检测性能与资源消耗的良好平衡。基于NVIDIA Jetson Xavier NX嵌入式平台的推理实验结果显示：与基线模型相比，该算法能够在满足实时检测要求的条件下，获得更高的检测精度，对于无人机实时目标检测场景具有较好的适用性。借助类激活图，对算法的决策过程进行了可视化分析，结果表明，该模型具备更优异的小尺度特征提取和高分辨率处理能力。

关键词: 小目标检测, 多尺度, 轻量化, YOLOv8, 可编程梯度信息

DONG Yibing, ZENG Hui, HOU Shaojie. LMUAV-YOLOv8: Lightweight Network for Object Detection in Low-Altitude UAV Vision[J]. Computer Engineering and Applications, 2025, 61(3): 94-110.

董一兵, 曾辉, 侯少杰. LMUAV-YOLOv8：低空无人机视觉目标检测轻量化网络[J]. 计算机工程与应用, 2025, 61(3): 94-110.

References

[1] 徐慧智, 古旭楠. 无人机视角下交通小目标图像检测算法优化研究 [J]. 计算机工程与应用, 2024, 60(21): 194-204.
XU H Z, GU X N. Research on optimization of UAV traffic small target image detection algorithm[J]. Computer Engineering and Applictions, 2024, 60(21): 194-204.
[2] 盛帅, 段先华, 胡维康, 等. Dynamic-YOLOX: 复杂背景下的苹果叶片病害检测模型[J]. 计算机科学与探索, 2024, 18(8): 2118-2129.
SHENG S, DUAN X H, HU W K, et al. Dynamic-YOLOX: detection model for apple leaf disease in complex background[J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(8): 2118-2129.
[3] 许云鹏, 谢雅琪, 于然, 等. 感-通-物多目标融合应急无人机路径规划方法[J]. 通信学报, 2024, 45(4): 1-12.
XU Y P, XIE Y Q, YU R, et al. Integrated perception-communication-logistics multi-objective[J]. Journal on Communications, 2024, 45(4): 1-12.
[4] SUN S, MO B, XU J, et al. Multi-YOLOv8: an infrared moving small object detection model based on YOLOv8 for air vehicle[J]. Neurocomputing, 2024, 588.
[5] 左志强, 刘正璇, 王一晶. 基于车路云一体化的混合交通系统优化控制综述[J]. 控制与决策, 2023, 38(3): 577-594.
ZUO Z Q, LIU Z X, WANG Y J. A survey of optimal control for mixed traffic system with vehicle-roadcloud integration[J]. Control and Decision, 2023, 38(3): 577-594.
[6] XIAO B, NGUYEN M, YAN W Q. Fruit ripeness identification using YOLOv8 model[J]. Multimedia Tools and Applications, 2024, 83(9): 28039-28056.
[7] SUN Y, ZHANG Y, WANG H, et al. SES-YOLOv8n: automatic driving object detection algorithm based on improved YOLOv8[J]. Signal, Image and Video Processing, 2024, 18: 3983-3992.
[8] 颜豪男, 吕伏, 冯永安. 特征级自适应增强的无人机目标检测算法[J]. 计算机科学与探索, 2024, 18(6): 1566-1578.
YAN H N, LYU F, FENG Y A. Feature-level adaptive enhancement for UAV target detection algorithm[J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(6): 1566-1578.
[9] CAO J, BAO W, SHANG H, et al. GCL-YOLO: a GhostConv-based lightweight YOLO network for UAV small object detection[J]. Remote Sensing, 2023, 15(20): 4932.
[10] LIU H, SUN F, GU J, et al. SF-YOLOv5: a lightweight small object detection algorithm based on improved feature fusion mode[J]. Sensors, 2022, 22(15): 5817.
[11] 雷帮军, 余翱, 余快. 基于YOLOv8s改进的小目标检测算法[J]. 无线电工程, 2024, 54(4): 857-870.
LEI B J, YU A, YU K. Small object detection algorithm based on improved YOLOv8s[J]. Radio Engineering, 2024, 54(4): 857-870.
[12] 潘玮, 韦超, 钱春雨, 等. 面向无人机视角下小目标检测的YOLOv8s改进模型[J]. 计算机工程与应用, 2024, 60(9): 142-150.
PAN W, WEI C, QIAN C Y, et al. Improved YOLOv8s model for small object detection rom perspective of drones[J]. Computer Engineering and Applictions, 2024, 60(9): 142-150.
[13] WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision (ECCV), 2018: 3-19.
[14] WANG G, CHEN Y, AN P, et al. UAV-YOLOv8: a small-object-detection model based on improved YOLOv8 for UAV aerial photography scenarios[J]. Sensors, 2023, 23(16): 7190.
[15] ZHU L, WANG X, KE Z, et al. BiFormer: vision transformer with bi-level routing attention[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023: 10323-10333.
[16] LI Y, FAN Q, HUANG H, et al. A modified YOLOv8 detection network for UAV aerial image recognition[J]. Drones, 2023, 7(5): 304.
[17] HAN K, WANG Y, TIAN Q, et al. GhostNet: more features from cheap operations[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 1580-1589.
[18] 李子轩, 赵志刚, 张泽宇, 等. 基于FNB-YOLOv5的钢筋网绑扎点目标检测[J/OL]. 上海交通大学学报: 1-24[2024-07-13]. https://doi.org/10.16183/j.cnki.jsjtu.2024.121.
LI Z X, ZHAO Z G, ZHANG Z Y, et al. Object detection of steel mesh binding point using FNB-YOLOv5[J/OL]. Journal of Shanghai Jiaotong University: 1-24[2024-07-13]. https://doi.org/10.16183/j.cnki.jsjtu.2024.121.
[19] XU X, JIANG Y, CHEN W, et al. Damo-YOLO: a report on real-time object detection design[J]. arXiv:2211.15444, 2022.
[20] WANG C Y, YEH I H, LIAO H Y M. YOLOv9: learning what you want to learn using programmable gradient Information[J]. arXiv:2402.13616, 2024.
[21] CHEN J, KAO S, HE H, et al. Run, don’t walk: chasing higher FLOPs for faster neural networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023: 12021-12031.
[22] MIRSA D, NALAMADA T, ARASANIPALAI A U, et al. Rotate to attend: convolutional triplet attention module[C]//Proceedings of the 2021 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 2021: 3138-3147.
[23] WU T, TANG S, ZHANG R, et al. CGNet: a light-weight context guided network for semantic segmentation[J]. IEEE Transactions on Image Processing, 2020, 30: 1169-1179.
[24] WANG C Y, LIAO H Y M, WU Y H, et al. CSPNet: a new backbone that can enhance learning capability of CNN[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020: 390-391.
[25] WANG C Y, LIAO H Y M, YEH I H. Designing network design strategies through gradient path analysis[J]. arXiv:2211.04800, 2022.
[26] DING X, ZHANG X, HAN J, et al. Diverse branch block: building a convolution as an inception-like unit[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021: 10886-10895.
[27] DU D W, ZHU P F, WEN L Y, et al. VisDrone-DET2019: the vision meets drone object detection in image challenge results[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), Seoul, 2019: 213-226.
[28] 程换新, 乔庆元, 骆晓玲, 等. 基于改进YOLOv8的无人机航拍图像目标检测算法[J]. 无线电工程, 2024, 54(4): 871-881.
CHENG H X, QIAO Q Y, LUO X L, et al. Object detection algorithm for UAV aerial image based on improved YOLOv8[J]. Radio Engineering, 2024, 54(4): 871-881.
[29] 王安静, 袁巨龙, 朱勇建, 等. 基于改进YOLOv8s的鼓形滚子表面缺陷检测算法[J]. 浙江大学学报 (工学版), 2024, 58(2): 370-380.
WANG A J, YUAN J L, ZHU Y J. Drum roller surface defect detection algorithm based on improved YOLOv8s[J]. Journal of Zhejiang University (Engineering Science), 2024, 58(2): 370-380.
[30] WANG W, MENG Y, LI S, et al. HV-YOLOv8 by HDPconv: better lightweight detectors for small object detection[J]. Image and Vision Computing, 2024, 147: 105052.
[31] SHAO Y, YANG Z, LI Z, et al. Aero-YOLO: an efficient vehicle and pedestrian detection algorithm based on unmanned aerial imagery[J]. Electronics, 2024, 13(7): 1190.