计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (15): 363-372.DOI: 10.3778/j.issn.1002-8331.2404-0465

• 工程与应用 • 上一篇    下一篇

基于表观特征增强及融合的安全帽检测算法

杨国亮,洪鑫芳,盛杨杨,熊文楷   

  1. 江西理工大学 电气工程与自动化学院,江西 赣州 341000
  • 出版日期:2025-08-01 发布日期:2025-07-31

Helmet Detection Algorithm Based on Appearance Feature Enhancement and Fusion

YANG Guoliang, HONG Xinfang, SHENG Yangyang, XIONG Wenkai   

  1. School of Electrical Engineering and Automation, Jiangxi University of Science and Technology, Ganzhou, Jiangxi 341000, China
  • Online:2025-08-01 Published:2025-07-31

摘要: 佩戴安全帽是保障工人生命安全的重要措施之一。针对安全帽佩戴检测网络难以准确定位和识别与环境颜色相似的安全帽和被遮挡的安全帽问题,提出一种基于表观特征增强及融合的安全帽检测算法。设计表观特征增强模块(appearance feature enhancement module,AFEM)来构建主干特征提取网络。该模块通过并行使用浅层分支、倒置瓶颈结构分支和非对称卷积分支,实现了梯度向不同分支的传播,进而捕捉更多丰富的形状、边缘和纹理等外观特征;此外,提出双尺度特征融合模块(dual-scale feature fusion,DSFF)。该模块通过融合更细粒度的浅层特征图和特征金字塔中深层特征图来获取和强化被遮挡目标的局部表观特征信息,从而提高残缺目标的检测精度。实验结果表明,在SHWD数据集上,改进后的模型在保持实时性同时,召回率和精度分别达到88.3%和92.6%,较原模型提高了0.8和1.2个百分点。在检测不同场景下的安全帽时表现更为出色,能够更好地适应各种复杂多样场景。

关键词: 安全帽检测, YOLOv8, 特征提取, 特征融合

Abstract: Wearing safety helmets is one of the important measures to ensure the life and safety of workers. Aiming at the problem that the safety helmet wearing detection network is difficult to accurately locate and identify the safety helmet with similar color to the environment and the occluded safety helmet, this paper proposes a safety helmet detection algorithm based on appearance feature enhancement and fusion. The appearance feature enhancement module (AFEM) is designed to construct the backbone feature extraction network. By using shallow branches, inverted bottleneck structure branches and asymmetric convolution branches in parallel, the module realizes the propagation of gradient to different branches, thereby capturing more rich appearance features such as shape, edge and texture. In addition, a dual-scale feature fusion module (DSFF) is proposed. The module obtains and strengthens the local appearance feature information of the occluded target by fusing the finer-grained shallow feature map and the deep feature map in the feature pyramid, so as to improve the detection accuracy of the incomplete target. Experimental results show that on the SHWD dataset, the improved model achieves 88.3%recall and 92.6% precision while maintaining real-time performance, which is 0.8 and 1.2 percentage points higher than the original model. It performs better in detecting safety helmets in different scenes, and can better adapt to various complex and diverse scenes.

Key words: safety helmet detection, YOLOv8, feature extraction, feature fusion