特征金字塔融合的多模态行人检测算法

doi:10.3778/j.issn.1002-8331.1812-0352

计算机工程与应用 ›› 2019, Vol. 55 ›› Issue (19): 214-222.DOI: 10.3778/j.issn.1002-8331.1812-0352

特征金字塔融合的多模态行人检测算法

童靖然，毛力，孙俊

江南大学江苏省模式识别与计算智能工程实验室，江苏无锡 214122

出版日期:2019-10-01 发布日期:2019-09-30

Multimodal Pedestrian Detection Algorithm Based on Fusion Feature Pyramids

TONG Jingran, MAO Li, SUN Jun

Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence, Jiangnan University, Wuxi, Jiangsu 214122, China

Online:2019-10-01 Published:2019-09-30

摘要/Abstract

摘要： 针对单模态行人检测在光照条件较差、目标部分遮挡、目标多尺度时检测效果较差的问题，提出了一种基于可见和红外双模态特征金字塔融合的行人检测算法。使用深度卷积神经网络代替传统的手工设计特征方式分别自动从可见模态及红外热模态的图片中提取单模态特征，根据ResNet（Residual Net）的阶段性特征图谱搭建特征金字塔网络，生成每个模态的特征金字塔，并将两个模态的特征金字塔进行逐层融合。选择深度学习通用目标检测算法——Faster R-CNN作为后续的目标定位与分类算法来解决多模态行人检测问题。在特征金字塔融合阶段，针对级联融合和较大值融合容易忽略弱特征，无法有效融合互补特征的问题，提出了一种锐化特征的特征金字塔融合方法，根据阈值强化突出强特征，互补叠加弱特征，有效利用每个模态的特征，进一步提高模型的检测效果。实验结果表明，特征金字塔聚合的多模态行人检测算法可以有效解决多模态行人检测问题，在KAIST数据集上的检测效果超过了目前该数据集上的最佳模型。

关键词: 行人检测, 多模态, 特征金字塔, 特征融合

Abstract: To solve the problems of poor pedestrian detection performance in a single modal due to poor lighting conditions, partial target occlusion and multi-scale target, this paper proposes a multimodal pedestrian detection algorithm based on the fusion of visible and infrared feature pyramids. It uses the deep convolutional neural networks to replace the traditional manual design features, and automatically extracts the features from visible and infrared images. According to the periodic feature maps of ResNet（Residual Net）, a feature pyramid network is built to generate the feature pyramid of each mode. The feature pyramids of each modal are fused layer by layer to create the fusion feature pyramid. It chooses the faster R-CNN algorithm do the following target location and classification algorithm to solve the multispectral pedestrian detection. In addition, in order to solve the problem of ignoring weak features and not effectively integrating complementary features in concatenation fusion and max fusion, the paper proposes a new feature pyramid fusion method. It highlights the strong features and complements the weak features by threshold, effectively utilizes the features of each mode. The multimodal pedestrian detection algorithm based on the fusion of visible and infrared feature pyramids can effectively solve the multimodal pedestrian detection problem, and outperforms state-of-art multimodal pedestrian detectors on the KAIST dataset benchmark.

Key words: pedestrian detection, multimodal, feature pyramid, feature fusion

童靖然，毛力，孙俊. 特征金字塔融合的多模态行人检测算法[J]. 计算机工程与应用, 2019, 55(19): 214-222.

TONG Jingran, MAO Li, SUN Jun. Multimodal Pedestrian Detection Algorithm Based on Fusion Feature Pyramids[J]. Computer Engineering and Applications, 2019, 55(19): 214-222.

[1]	陆莉霞，邹俊忠，郭玉成，张见，王蓓. 多模态融合的膝关节损伤预测[J]. 计算机工程与应用, 2021, 57(9): 225-232.
[2]	董旭彬，赵清华. 改进Mask R-CNN在航空影像目标检测的研究应用[J]. 计算机工程与应用, 2021, 57(8): 133-144.
[3]	王玲，王家沛，王鹏，孙爽滋. 融合注意力机制的孪生网络目标跟踪算法研究[J]. 计算机工程与应用, 2021, 57(8): 169-174.
[4]	李明山，韩清鹏，张天宇，王道累. 改进SSD的安全帽检测方法[J]. 计算机工程与应用, 2021, 57(8): 192-197.
[5]	郭晓静，隋昊达. 改进YOLOv3在机场跑道异物目标检测中的应用[J]. 计算机工程与应用, 2021, 57(8): 249-255.
[6]	唐国智，李顶根. 深度学习及时空约束的行人跟踪算法研究[J]. 计算机工程与应用, 2021, 57(7): 121-129.
[7]	沈新烽，姜平，周根荣. 改进SSD算法在零部件检测中的应用研究[J]. 计算机工程与应用, 2021, 57(7): 257-262.
[8]	金旺，易国洪，洪汉玉，陈思媛. 基于卷积神经网络的实时车辆检测[J]. 计算机工程与应用, 2021, 57(5): 222-228.
[9]	韩文静，罗晓曙，杨日星. 一种复合型手势识别方法研究[J]. 计算机工程与应用, 2021, 57(4): 108-113.
[10]	赵辉，李志伟，方禄发. 特征信息增强的单发多框检测器算法[J]. 计算机工程与应用, 2021, 57(4): 148-154.
[11]	李佐龙，王帮海，卢增. 多尺度特征融合重建的行人检测方法[J]. 计算机工程与应用, 2021, 57(4): 176-182.
[12]	王殿伟，赵梦影，刘颖，宋海军，谢永军. 改进的R-SSD全景视频图像车辆检测算法[J]. 计算机工程与应用, 2021, 57(3): 189-195.
[13]	卢苇，刘丹，邵敏，吴扬东. 改进Mask R-CNN网络在医学图像识别与分割中的应用[J]. 计算机工程与应用, 2021, 57(24): 234-241.
[14]	肖瑞雪，冯英伟，屈建萍. 结合高效特征融合的可变尺寸图像隐写分析[J]. 计算机工程与应用, 2021, 57(24): 126-134.
[15]	杨锶齐，易尧华，汤梓伟，王新宇. 嵌入注意力机制的自然场景文本检测方法[J]. 计算机工程与应用, 2021, 57(24): 185-191.

特征金字塔融合的多模态行人检测算法

Multimodal Pedestrian Detection Algorithm Based on Fusion Feature Pyramids

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics