多尺度特征融合重建的行人检测方法

doi:10.3778/j.issn.1002-8331.1912-0174

计算机工程与应用 ›› 2021, Vol. 57 ›› Issue (4): 176-182.DOI: 10.3778/j.issn.1002-8331.1912-0174

多尺度特征融合重建的行人检测方法

李佐龙，王帮海，卢增

广东工业大学计算机学院，广州 510006

出版日期:2021-02-15 发布日期:2021-02-06

Pedestrian Detection Method Based on Multi-scale Feature Fusion and Reconstruction

LI Zuolong, WANG Banghai, LU Zeng

School of Computer, Guangdong University of Technology, Guangzhou 510006, China

Online:2021-02-15 Published:2021-02-06

摘要/Abstract

摘要：

行人在众多场景中都存在多尺度变化问题，严重影响检测器的精度，为此设计卷积特征重建和通道注意力两种模块来增强对多尺度行人的检测效果。以原始输入的多尺度特征为基础融合重建多个特征金字塔，然后融合多个特征金字塔中的相同尺度特征，并学习每层特征的通道注意力权值来增加有效通道层权重，由此得到的特征才用于最后的检测。将这两种模块集成到RFBnet模型中，并改进模型损失函数用以优化对遮挡行人的检测效果。在Caltech-USA、INRIA和ETH三个数据集上的测试结果表明，新方法的准确率高于RFBnet和MS-CNN等一些多尺度方法，在不同尺度行人的测试子集上达到了最优的检测效果。

关键词: 行人检测, 卷积神经网络, 多尺度特征, 遮挡处理

Abstract:

Multi-scale changes of pedestrians in many scenes seriously affect the accuracy of the detector, therefore, two modules of convolution feature reconstruction and channel attention are designed to enhance the detection effect of multi-scale pedestrians. Feature pyramids are reconstructed based on the multi-scale features of the original input and feature fusion. Then, the same scale features in multiple feature pyramids are fused to learn the channel attention of each feature layer, and the effective channel layer weight is increased by the weight, so that the features obtained can be used for the final detection. The two modules are integrated into the RFBnet model, and the model loss function is improved to optimize the detection effect of occluded pedestrians. The test results of Caltech-USA, INRIA and ETH data sets show that the accuracy of the new method is higher than that of some multi-scale methods such as RFBnet and MS-CNN, achieving the optimal detection effect on the test subsets of multi-scale pedestrians.

Key words: pedestrian detection, convolutional neural network, multi-scale feature, occlusion handling

李佐龙，王帮海，卢增. 多尺度特征融合重建的行人检测方法[J]. 计算机工程与应用, 2021, 57(4): 176-182.

LI Zuolong, WANG Banghai, LU Zeng. Pedestrian Detection Method Based on Multi-scale Feature Fusion and Reconstruction[J]. Computer Engineering and Applications, 2021, 57(4): 176-182.

[1]	冉蓉，徐兴华，邱少华，崔小鹏，欧阳斌. 基于深度卷积神经网络的裂纹检测方法综述[J]. 计算机工程与应用, 2021, 57(9): 23-35.
[2]	牟清萍，张莹，张东波，王新杰，杨知桥. 目标丢失判别机制的视觉跟踪算法及应用研究[J]. 计算机工程与应用, 2021, 57(9): 140-147.
[3]	包志强，邢瑜，吕少卿，黄琼丹. 改进YOLO V2的6D目标姿态估计算法[J]. 计算机工程与应用, 2021, 57(9): 148-153.
[4]	赵志焱，杨华，胡志伟，宇海萍. 基于TACNN的玉露香梨叶虫害识别[J]. 计算机工程与应用, 2021, 57(9): 176-181.
[5]	周伦钢，孙怡峰，王坤，吴疆，黄维贵，李炳龙. 目标多种多值属性的端端快速识别网络[J]. 计算机工程与应用, 2021, 57(9): 182-190.
[6]	张成，戴俊峰，熊闻心. 融合LeNet-5改进的扫描文档手写日期识别[J]. 计算机工程与应用, 2021, 57(9): 207-211.
[7]	麻哲旭，杨峰，乔旭. 铁路路基病害智能检测方法[J]. 计算机工程与应用, 2021, 57(9): 272-278.
[8]	张越，黄友锐，刘鹏坤. 引入注意力机制的多分辨率人体姿态估计研究[J]. 计算机工程与应用, 2021, 57(8): 126-132.
[9]	李现国，冯欣欣，李建雄. 多尺度残差网络的单幅图像超分辨率重建[J]. 计算机工程与应用, 2021, 57(7): 215-221.
[10]	梁芳烜，杨锋，卢丽云，尹梦晓. 基于卷积神经网络的脑肿瘤分割方法综述[J]. 计算机工程与应用, 2021, 57(7): 34-43.
[11]	杨培伟，周余红，邢岗，田智强，许夏瑜. 卷积神经网络在生物医学图像上的应用进展[J]. 计算机工程与应用, 2021, 57(7): 44-58.
[12]	唐国智，李顶根. 深度学习及时空约束的行人跟踪算法研究[J]. 计算机工程与应用, 2021, 57(7): 121-129.
[13]	常昊，陈晓雷，张爱华，李策，林冬梅. 嵌入改进SENet的卷积神经网络连续血压预测[J]. 计算机工程与应用, 2021, 57(7): 130-135.
[14]	王翀，韩振奇，徐浩煜，祝永新，徐胜，陈夏. 基于改进显著图的高效裂纹检测算法[J]. 计算机工程与应用, 2021, 57(6): 219-224.
[15]	黄金杰，蔺江全，何勇军，何瑾洁，王雅君. 局部语义与上下文关系的中文短文本分类算法[J]. 计算机工程与应用, 2021, 57(6): 94-100.

多尺度特征融合重建的行人检测方法

Pedestrian Detection Method Based on Multi-scale Feature Fusion and Reconstruction

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics