计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (2): 247-258.DOI: 10.3778/j.issn.1002-8331.2407-0233

• 图形图像处理 • 上一篇    下一篇

YOLO-sea:改进YOLOv7-tiny的复杂海底目标检测算法研究

李润东,曲英伟,殷丽凤,郑广海   

  1. 大连交通大学 软件学院,辽宁 大连 116028
  • 出版日期:2025-01-15 发布日期:2025-01-15

YOLO-sea:Improved Complex Undersea Target Detection Algorithm for YOLOv7-tiny

LI Rundong, QU Yingwei, YIN Lifeng, ZHENG Guanghai   

  1. School of Software, Dalian Jiaotong University, Dalian, Liaoning 116028, China
  • Online:2025-01-15 Published:2025-01-15

摘要: 海底成像质量差、分辨率低导致目标边缘模糊、识别困难,小目标的聚集又增加了漏检和误检的风险。针对这些问题,考虑到YOLOv7-tiny算法兼顾高精确度和小体积的特点,在其基础上设计了YOLO-sea网络检测算法。针对低分辨率场景小目标的特征学习不足、细粒度信息易丢失的问题,基于SPDConv(space-to-depth convolution)改进主干网络,提高低分辨率场景下密集小目标特征的提取能力。针对海底成像模糊、目标边缘识别困难的问题,设计了参数共享对比度增强注意力机制(parameter shared contrast enhanced attention,PSCEA)来优化局部细节和边缘信息的表示。基于YOLOv9的GELAN架构和DSConv(dynamic snake convolution)的思想,设计高效的聚合模块DSCELAN,轻量化同时增强对海底海参、鱼类等细长目标的聚焦能力。重构检测头,进一步提升小目标的检测效果。改进后的模型YOLO-sea算法在DUO数据集上的mAP提升了2.8个百分点,参数量减少了41%,证明了该创新在海底检测方面的优势。在主流网络YOLOv5s、YOLOv7-tiny和YOLOv8n上均进行注意力对比实验,加入PSCEA机制后使mAP分别提高了1.1、1.3和0.7个百分点,证明了该机制的泛化性和有效性。

关键词: 海底检测, YOLO, GELAN, DSConv, 深度学习

Abstract: The poor quality and low resolution of seabed imaging lead to blurred target edges and difficulty in identification. The aggregation of small targets increases the risk of missed detection and false detection. To address these problems, the YOLO-sea network detection algorithm is designed based on the YOLOv7-tiny algorithm, which combines high accuracy with small size. Firstly, to address the problems of insufficient feature learning of small targets in low-resolution scenes and easy loss of fine-grained information, the backbone network is redesigned based on SPDConv (space-to-depth convolution) to improve the ability to extract features of dense small targets in low-resolution scenes. Secondly, to address the problems of blurred seabed imaging and difficulty in identifying target edges, a parameter-shared contrast enhanced attention mechanism (PSCEA) is designed to optimize the representation of local details and edge information. Thirdly, based on the GELAN architecture of YOLOv9 and the idea of DSConv (dynamic snake convolution), an efficient aggregation module DSCELAN (DSC-GELAN) is designed to reduce the weight while enhancing the focusing ability on slender targets such as sea cucumbers and fish on the seabed. Finally, the detection head is reconstructed to further improve the detection effect of small targets. The improved model YOLO-sea algorithm has improved mAP by 2.8 percentage points on the DUO dataset and has reduced the number of parameters by 41%, proving the advantages of this innovation in seabed detection. In addition, attention comparison experiments are conducted on the mainstream networks YOLOv5s, YOLOv7-tiny and YOLOv8n. After adding the PSCEA mechanism, the mAP has increased by 1.1, 1.3 and 0.7 percentage points respectively, proving the generalization and effectiveness of the mechanism.

Key words: seabed resource monitoring, YOLO, GELAN, DSConv, deep learning