Computer Engineering and Applications ›› 2024, Vol. 60 ›› Issue (9): 292-298.DOI: 10.3778/j.issn.1002-8331.2302-0275

• Graphics and Image Processing • Previous Articles     Next Articles

Small Object Detection Based on Structure Perception and Global Context Information

LI Zhonghua, LIN Chujun, ZHU Hengliang, LIAO Shiyu, BAI Yunqi   

  1. College of Computer Science and Mathematics, Fujian University of Technology, Fuzhou 350118, China
  • Online:2024-05-01 Published:2024-04-29

基于结构感知和全局上下文信息的小目标检测

李钟华,林初俊,朱恒亮,廖诗宇,白云起   

  1. 福建理工大学 计算机科学与数学学院,福州 350118

Abstract: In the small targets detection task, it’s difficult to detect the small objects with limited pixels and unabundant characteristics. So it is easy to cause many problems, such as missed or failed detection, and low accuracy of the model. To?address?these?issues, this paper proposes a novel multi-scale structure perception and global contextual information network for small object detection. Firstly, a multi-scale structure perception module (MSSP) is proposed to capture the detailed features of small targets, it can enhance the ability of model to identify objects with different sizes. Secondly, in order to obtain more global features, a global context module (GCM) is introduced to extract global information and effectively establish the relationship between different pixels. Finally, a new IoU loss function, namely W-CIoU, is designed for the tiny objects detection, which can relieve the gradient explosion phenomenon caused by too few small target pixels during model training. Extensive experiments show that the proposed approach achieves higher accuracy than other classic lightweight methods. Compared with the baseline, the proposed model obtains over 6.4 percentage points mAP50 gain and 4.6 percentage points mAP50:95 gain on the VisDrone dataset, and also has a good performance on the TinyPerson dataset.

Key words: small object detection, attention mechanism, context information, loss function

摘要: 在小目标检测任务中,由于小目标像素值少、特征不丰富和难提取等局限性,容易导致模型漏检、误检以及精度低等问题,提出了一种基于多尺度结构感知和全局上下文信息的小目标检测算法。针对复杂场景设计了多尺度结构感知模块,可以更好地捕获小目标的细节特征,以此增强模型识别不同尺寸物体的检测能力。为了获取更多的全局特征,借助Transformer捕获长距离依赖特征的优势设计了全局上下文信息模块,有效地建立起不同区域像素点之间的联系。针对模型训练时的梯度爆炸现象,设计了一种新的带权重损失函数W-CIoU,使得训练时的收敛速度有明显改善。大量的实验结果表明,提出的方法相较于其他经典的轻量级方法取得了较好的检测效果。与基准模型相比,提出的模型在VisDrone数据集上mAP50和mAP50:95分别提高了6.4和4.6个百分点,同时在TinyPerson数据集上也有着不错的表现。

关键词: 小目标检测, 注意力机制, 上下文信息, 损失函数