基于深度卷积神经网络的道路场景理解

doi:10.3778/j.issn.1002-8331.1708-0195

计算机工程与应用 ›› 2017, Vol. 53 ›› Issue (22): 8-15.DOI: 10.3778/j.issn.1002-8331.1708-0195

基于深度卷积神经网络的道路场景理解

吴宗胜1，2，傅卫平2，韩改宁1

1.咸阳师范学院计算机学院，陕西咸阳 712000
2.西安理工大学机械与精密仪器工程学院，西安 710048

出版日期:2017-11-15 发布日期:2017-11-29

Road scene understanding based on deep convolutional neural network

WU Zongsheng1，2, FU Weiping2, HAN Gaining1

1.School of Computer, Xianyang Normal University, Xianyang, Shaanxi 712000, China
2.School of Mechanical and Precision Instrumental Engineering, Xi’an University of Technology, Xi’an 710048, China

Online:2017-11-15 Published:2017-11-29

摘要/Abstract

摘要： 在无人驾驶技术中，道路场景的理解是一个非常重要的环境感知任务，也是一个很具有挑战性的课题。提出了一个深层的道路场景分割网络（Road Scene Segmentation Network，RSSNet），该网络为32层的全卷积神经网络，由卷积编码网络和反卷积解码网络组成。网络中采用批正则化层防止了深度网络在训练中容易出现的“梯度消失”问题；在激活层中采用了Maxout激活函数，进一步缓解了梯度消失，避免网络陷入饱和模式以及出现神经元死亡现象；同时在网络中适当使用Dropout操作，防止了模型出现过拟合现象；编码网络存储了特征图的最大池化索引并在解码网络中使用它们，保留了重要的边缘信息。实验证明，该网络能够大大提高训练效率和分割精度，有效识别道路场景图像中各像素的类别并对目标进行平滑分割，为无人驾驶汽车提供有价值的道路环境信息。

关键词: 深度学习, 卷积神经网络, 场景理解, 语义分割

Abstract: In the self-driving technology, the road scene understanding is a very important task for environment perception, and it is a challenging topic. In this paper, a deep Road Scene Segmentation Network（RSSNet） is presented, which is a 32-layer full convolutional network composed of convolution encoded network and deconvolution decoded network. The batch normalization layer used in the RSSNet prevents the vanishing gradient problem from appearing during the training process; the activation layer using the Maxout function further weakens the vanishing gradient and avoids the network falling into a saturated mode and neuron death phenomenon; moreover, the RSSNet using dropout operation prevents the over-fitting phenomenon of the network model; the max-pool indices of the feature map saved by the encoded-network are used in the decoded-network to upsample the feature map, which keeps the important edge information down. The experimental results show that the RSSNet can greatly improve the training efficiency and the segmentation accuracy, effectively classify each pixel in the road scene image and smoothly segment the objects, and provide useful information of road environment for driverless cars.

Key words: deep learning, convolutional neural network, scenes understanding, semantic segmentation

吴宗胜1，2，傅卫平2，韩改宁1. 基于深度卷积神经网络的道路场景理解[J]. 计算机工程与应用, 2017, 53(22): 8-15.

WU Zongsheng1，2, FU Weiping2, HAN Gaining1. Road scene understanding based on deep convolutional neural network[J]. Computer Engineering and Applications, 2017, 53(22): 8-15.

[1]	牟清萍，张莹，张东波，王新杰，杨知桥. 目标丢失判别机制的视觉跟踪算法及应用研究[J]. 计算机工程与应用, 2021, 57(9): 140-147.
[2]	包志强，邢瑜，吕少卿，黄琼丹. 改进YOLO V2的6D目标姿态估计算法[J]. 计算机工程与应用, 2021, 57(9): 148-153.
[3]	黄冬宜，杨兵，吴子豪，匡佳一，颜泽明. 用于全市蜂窝流量预测的时空全连接卷积网络[J]. 计算机工程与应用, 2021, 57(9): 168-175.
[4]	赵志焱，杨华，胡志伟，宇海萍. 基于TACNN的玉露香梨叶虫害识别[J]. 计算机工程与应用, 2021, 57(9): 176-181.
[5]	周伦钢，孙怡峰，王坤，吴疆，黄维贵，李炳龙. 目标多种多值属性的端端快速识别网络[J]. 计算机工程与应用, 2021, 57(9): 182-190.
[6]	张成，戴俊峰，熊闻心. 融合LeNet-5改进的扫描文档手写日期识别[J]. 计算机工程与应用, 2021, 57(9): 207-211.
[7]	麻哲旭，杨峰，乔旭. 铁路路基病害智能检测方法[J]. 计算机工程与应用, 2021, 57(9): 272-278.
[8]	武文杰，宋文爱，高雪梅，杨吉江，王青，黄丽萍，雷毅. 基于X线的成人OSA计算机辅助诊断综述[J]. 计算机工程与应用, 2021, 57(9): 1-8.
[9]	冉蓉，徐兴华，邱少华，崔小鹏，欧阳斌. 基于深度卷积神经网络的裂纹检测方法综述[J]. 计算机工程与应用, 2021, 57(9): 23-35.
[10]	李晓筱，胡晓光，王梓强，杜卓群. 基于深度学习的实例分割研究进展[J]. 计算机工程与应用, 2021, 57(9): 60-67.
[11]	徐少杰，曹雏清，王永娟. 视觉SLAM在室内动态场景中的应用研究[J]. 计算机工程与应用, 2021, 57(8): 175-179.
[12]	李明山，韩清鹏，张天宇，王道累. 改进SSD的安全帽检测方法[J]. 计算机工程与应用, 2021, 57(8): 192-197.
[13]	曾春艳，严康，王志锋，余琰，纪纯妹. 深度学习模型可解释性研究综述[J]. 计算机工程与应用, 2021, 57(8): 1-9.
[14]	许德刚，王露，李凡. 深度学习的典型目标检测算法研究综述[J]. 计算机工程与应用, 2021, 57(8): 10-25.
[15]	蒋斌，钟瑞，张秋闻，张焕龙. 采用深度学习方法的非正面表情识别综述[J]. 计算机工程与应用, 2021, 57(8): 48-61.

基于深度卷积神经网络的道路场景理解

Road scene understanding based on deep convolutional neural network

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics