计算机工程与应用 ›› 2026, Vol. 62 ›› Issue (7): 131-142.DOI: 10.3778/j.issn.1002-8331.2509-0101

• YOLO改进及应用专题 • 上一篇    下一篇

改进YOLOv13的红外遥感小目标检测算法

李平+,陈继锋   

  1. 湖南涉外经济学院 信息与机电工程学院,长沙 410205
    + 通信作者 E-mail:3916383670@qq.com
  • 收稿日期:2025-09-09 修回日期:2025-12-08 在线发布日期:2026-04-01 出版日期:2026-04-01
  • 基金资助:
    湖南省教育厅科学研究重点项目(23A0659)。

Improved YOLOv13 for Infrared Remote Sensing Small Object Detection

LI Ping+, CHEN Jifeng   

  1. School of Information and Mechatronic Engineering, Hunan International Economics University, Changsha 410205, China
    + Corresponding author E-mail:3916383670@qq.com
  • Received:2025-09-09 Revised:2025-12-08 Online:2026-04-01 Published:2026-04-01

摘要: 为了有效应对红外遥感场景中小目标尺度较小、信噪比低且易被复杂背景淹没等挑战,对YOLOv13模型进行改进,实现实时性和轻量化的检测需求。通过构建多阶特征聚合模块(multi-level feature aggregation module, MFAM),自底向上汇聚不同语义深度与空间分辨率的层级信息,并自适应重标定其贡献,以缓解小目标在深层语义中被稀释的问题。设计了双路径融合金字塔网络(dual-path fusion pyramid network,DFPN),以互补的自顶向下语义增强路径与自底向上细节回流路径实现跨尺度信息循环交互,从而强化弱小热目标的可分性。提出的上下文感知融合模块(context-aware fusion block,CAFBlock)采用全局自注意力和局部深度卷积的双分支结构以协同建模长距离依赖与精细局部特征,同时结合膨胀卷积多感受野和深度卷积局部细节的双路径处理方式与门控融合机制,全面增强模型的多尺度上下文建模能力。在SIRST和HIT-UAV数据集上进行对比评估,改进模型实现了90.06%和64.37%的AP,分别提高了7.65个百分点和8.55个百分点,充分验证了模型的有效性和可行性。

关键词: 红外遥感, YOLOv13, 小目标检测, 跨尺度, 特征融合, Transformer

Abstract: To address the challenges of small object size, low signal-to-noise ratio, and susceptibility to complex background interference in infrared remote sensing, the YOLOv13 model is improved to meet the requirements of real-time and lightweight detection. A multi-level feature aggregation module (MFAM) is constructed to aggregate hierarchical information from different semantic depths and spatial resolutions in a bottom-up manner, while adaptively recalibrating their contributions to alleviate the dilution of small objects in deep semantic layers. A dual-path fusion pyramid network (DFPN) is designed, where a top-down semantic enhancement path and a bottom-up detail refinement path interact in a complementary manner to achieve cross-scale information circulation, thereby enhancing the separability of weak thermal targets. The proposed context-aware fusion block (CAFBlock) adopts a dual-branch structure of global self-attention and local depthwise convolution to jointly model long-range dependencies and fine-grained local features. In addition, it integrates a dual-path processing strategy of dilated convolution with multiple receptive fields and depthwise convolution for local details, combined with a gated fusion mechanism, to comprehensively strengthen multi-scale context modeling. Comparative experiments on the SIRST and HIT-UAV datasets demonstrate that the improved model achieves 90.06% and 64.37% AP, with relative gains of 7.65 percentage points and 8.55 percentage points, respectively, which verifies the effectiveness and feasibility of the proposed approach.

Key words: infrared remote sensing, YOLOv13, small object detection, cross-scale, feature fusion, Transformer