计算机工程与应用 ›› 2023, Vol. 59 ›› Issue (19): 166-176.DOI: 10.3778/j.issn.1002-8331.2205-0530

• 图形图像处理 • 上一篇    下一篇

基于注意力增强的行人与头肩级联检测算法

庄淑青,张晓伟,曹帅,宋明晨   

  1. 1.青岛大学 计算机科学技术学院,山东 青岛 266071
    2.海信研究发展中心虚拟现实部,山东 青岛 266071
  • 出版日期:2023-10-01 发布日期:2023-10-01

Pedestrian and Head-Shoulders Cascade Detection Algorithm Based on Attention Enhancement

ZHUANG Shuqing, ZHANG Xiaowei, CAO Shuai, SONG Mingchen   

  1. 1.School of Computer Science and Technology, Qingdao University, Qingdao, Shandong 266071, China
    2.Virtual Reality Department of Hisense Research and Development Center, Qingdao, Shandong 266071, China
  • Online:2023-10-01 Published:2023-10-01

摘要: 在监控视频中,行人会存在不同视角、不同程度的遮挡问题,导致当前行人检测器漏检率、错检率较高,于是提出了一种注意力增强下的行人整体与行人头肩区域级联检测的行人检测器。提出一种新的通道注意力机制,称为全卷积通道注意力机制;针对分类和回归任务分别融入相适应的注意力机制,来增强有效的检测特征,抑制背景特征信息;设计行人整体与行人头肩区域级联行人检测器,通过行人整体与行人头肩区域的匹配算法,级联地处理检测结果。该算法,尤其针对下半身严重遮挡的情况,极大降低了遮挡行人的漏检率。实验结果表明,在Caltech公开行人检测测试数据集Reasonable(合理子集)的对数平均漏检率降低到5.37%,尤其在Occ=heavy(严重遮挡子集)上的对数平均漏检率降低到23.33%,同时在ETH和CityPersons行人检测数据集上,该算法亦拥有较好的检测效果。

关键词: 遮挡行人检测, 注意力机制, 头肩区域检测分支, 级联检测器

Abstract: In the surveillance video, pedestrians may have different perspectives and different degrees of occlusion, resulting in a high rate of missed detection and error detection of the current pedestrian detector. Therefore, a pedestrian detector is proposed for cascading detection of the whole pedestrian and the head and shoulder area of the pedestrian with enhanced attention. Firstly, a new channel attention mechanism called fully convolutional channel attention mechanism is proposed. Secondly, channel attention mechanism and spatial attention mechanism are added respectively for classification and regression tasks to enhance effective detection features and suppress background features. Finally, a cascade pedestrian detector is designed for the pedestrian as a whole and the pedestrian head and shoulder area, and the detection results are cascaded through the matching algorithm of the pedestrian as a whole and the pedestrian head and shoulder area. This algorithm also can greatly weaken the missed detection rate, particularly in the case of serious occlusion of the lower body. Experimental results indicate that the log-average missed rate of Reasonable(reasonable subset) pedestrian detection test dataset in Caltech is reduced to 5.37%, and the log-average missed rate of Occ=heavy(severely obscured subset) is reduced to 23.33%. At the same time, the algorithm also has good detection performance on ETH and CityPersons pedestrian detection datasets.

Key words: blocking pedestrian detection, attention mechanism, head-shoulder area detection branch, cascade detector