Select
LMUAV-YOLOv8: Lightweight Network for Object Detection in Low-Altitude UAV Vision
DONG Yibing, ZENG Hui, HOU Shaojie
Computer Engineering and Applications
2025, 61 (3 ):
94-110.
DOI: 10.3778/j.issn.1002-8331.2407-0127
To tackle the challenges of weak sensing capacity and high missed detection rates for small-scale objects using low-altitude UAV in complex traffic scenarios, the LMUAV-YOLOv8 algorithm is proposed. Its efficiency and advantage are verified through ablation and comparative experiments. The internal mechanisms is visualized by using the method of class activation mapping. In this dissertation, a lightweight feature fusion network (UAV_RepGFPN) is introduced firstly, proposing new feature fusion paths and a feature fusion module DBB_GELAN, which reduces the number of parameters and computation while improving the performance of the feature fusion network. Secondly, the feature extraction module (FTA_C2f) is constructed using partial convolution (PConv) and triplet attention mechanism (Triplet Attention), and the ADown down-sampling module is introduced. By rearranging the dimensions of the input feature maps and making fine-grained adjustments, the ability of the deep network to capture spatial features is enhanced, further reducing the number of parameters and computation. Then, concerning large amount of information loss during in layer-by-layer feature extraction and spatial transformation, a new context-guided programmable gradient information (UAV_PGI) strategy is proposed. By designing a context-guided reversible architecture and three additional auxiliary detection heads, UAV_PGI significantly enhance detection capabilities for aerial objects. In order to verify the validity and generalization ability of the model, comparative experiments are carried out on the VisDrone 2019 test set, and the results show that: compared with YOLOv8s, LMUAV-YOLOv8s on the VisDrone 2019 test set improves precision, recall, mAP@0.5, and mAP@0.5:0.95 by 4.2, 3.9, 5.1, and 3.0?percentage points, separately, with the computational cost increased by only 0.4?GFLOPs and the parameter count reduced by 63.9%, meaning a good balance between performance and cost. The inference experimental results based on NVIDIA Jetson Xavier NX embedded platform show that compared with the baseline model, the proposed algorithm can obtain higher detection accuracy under the condition of meeting the requirements of real-time detection, rendering it more suitable for real-time target detection scenarios in drones. Finally, the decision making process is visualized by using the method of class activation mapping, which provides a intuitive way to understand the internal mechanisms of the networ. And the results show that the proposed model has superior small-scale feature extraction and high-resolution processing capabilities.
Reference |
Related Articles |
Metrics