计算机工程与应用 ›› 2026, Vol. 62 ›› Issue (5): 88-105.DOI: 10.3778/j.issn.1002-8331.2505-0210

• YOLOv11 改进及应用专题 • 上一篇    下一篇

PCSED-YOLO:复杂环境下跨尺度多目标穿戴检测算法研究

薛光辉1,2+,闫朝阳1,吴冕1   

  1. 1.中国矿业大学(北京) 机械与电气工程学院,北京 100083
    2.煤矿智能化与机器人创新应用应急管理部重点实验室,北京 100083
    + 通信作者 E-mail:xgh@cumtb.edu.cn
  • 收稿日期:2025-05-18 修回日期:2025-08-20 在线发布日期:2026-03-01 出版日期:2026-03-01
  • 基金资助:
    深地国家科技重大专项(2025ZD1010704);新疆维吾尔自治区科技计划(2025B01001);国家自然科学基金面上项目(51075388)。

PCSED-YOLO: Study on Cross-Scale Multi-Object Wearable Detection Algorithm in Complex Environments

XUE Guanghui1,2+, YAN Zhaoyang1, WU Mian1   

  1. 1.School of Mechanical and Electrical Engineering, China University of Mining and Technology (Beijing), Beijing 100083, China
    2.Key Laboratory of Intelligent Mining and Robotics, Ministry of Emergency Management, Beijing 100083, China
    + Corresponding author E-mail:xgh@cumtb.edu.cn
  • Received:2025-05-18 Revised:2025-08-20 Online:2026-03-01 Published:2026-03-01

摘要: 车间工人在作业期间若未按规定穿戴安全装备,不仅可能对其健康造成影响,还可能导致伤亡等安全事故。基于此,计算机视觉的穿戴检测技术是目前研究的热点领域。然而,由于生产车间内设备繁多,环境复杂且恶劣,生产过程中产生的大量粉尘会使穿戴目标变得模糊或部分被遮挡。此外,穿戴目标的尺寸分布范围宽,属于复杂环境下跨尺度多目标检测范畴。现有的算法在检测精度方面存在不足,特别是对口罩等小目标的误检和漏检率较高。为此,提出了一种基于YOLO模型的改进目标检测算法:PCSED-YOLO。在C3k2中融合了并行补丁感知模块,以增强小目标特征提取及多尺度目标检测能力;将交叉卷积注意力融合模块嵌入C2PSA,实现局部特征感知与全局上下文信息的互补,从而提升粉尘场景中的目标识别能力;引入空间到深度卷积替代原有的卷积层,通过重组空间维度信息至通道维度,实现无损下采样,提升小目标和低分辨率目标的检测性能;融合SEv2(squeeze-and-excitation network v2),创新改进空间金字塔池化层,增强模型对复杂场景的全局上下文把控能力,提升多类别、跨尺度目标的特征提取能力;在检测头引入动态卷积Dynamic-Conv,通过动态调整卷积核的大小和形状,提升跨尺度目标检测的精度;增加更高分辨率的P2检测层,提高小目标检测精度。制备了工人穿戴数据集,并进行了消融和对比实验。实验结果显示,PCSED-YOLO算法模型在处理小目标、中目标和大目标时均表现出色,与基准模型相比,mAP@0.5达到了0.946,提升了0.077;AP@0.5mask(小目标)达到了0.887,提升了0.236;AP@0.5no-helmet(中目标)提升了0.037至0.958;AP@0.5vest(大目标)提升了0.006至0.991;F1-Score和P-R曲线指标较基准模型也有明显改善。与几种先进的检测模型相比,PCSED-YOLO模型在制备的数据集上取得了最佳的检测性能,表明该模型具有较强的复杂环境跨尺度多目标检测能力和泛化能力,为复杂环境下跨尺度多目标穿戴检测提供了新的算法方案。

关键词: 安全穿戴检测, 小目标检测, 多尺度目标检测, 深度学习, YOLO

Abstract: Non-standard safety wear by workshop workers during operations can affect physical health and lead to safety accidents, including injuries or fatalities. Computer vision-based detection of personal protective equipment has become a research focus. Workshops have complex environments with numerous devices and harsh conditions, dust generated during production blurs or obscures targets, presenting challenges for multi-target detection across scales. Existing algorithms struggle with low accuracy and high false negative rates for small targets like masks. To address this, an improved YOLO-based detection algorithm, PCSED-YOLO, is proposed. The PPA module enhances feature extraction for small targets and multi-scale detection. The CAFM module in C2PSA achieves local and global feature complementarity, enhancing recognition in dusty environments. SPD-Conv replaces original convolutional layers for lossless down-sampling, improving detection of small, low-resolution targets. The SEv2 improvement of the spatial pyramid pooling layer enhances global context control and feature extraction for multi-class, cross-scale targets. Dynamic-Conv in the detection head adjusts convolution kernel size and shape for precise cross-scale detection. A higher resolution P2 detection layer improves small target accuracy. A dataset of workers’ protective wear is compiled for ablation and comparative experiments. Results show that PCSED-YOLO excels in handling targets of various sizes, achieving a mAP@0.5 of 0.946, a 0.077 improvement over the baseline. AP@0.5mask reaches 0.887, a 0.236 improvement, AP@0.5no-helmet is increased by 0.037 to 0.958, and AP@0.5vest is improved by 0.006 to 0.991. F1-Score and P-R curve metrics show significant enhancement. Compared to advanced models, PCSED-YOLO achieves the best accuracy on the dataset, demonstrating strong capabilities in complex environments and providing a new algorithm for multi-target detection of protective wear.

Key words: safety wear detection, small object detection, multi-scale object detection, deep learning, YOLO