Road scene understanding based on deep convolutional neural network

doi:10.3778/j.issn.1002-8331.1708-0195

Abstract

Abstract: In the self-driving technology, the road scene understanding is a very important task for environment perception, and it is a challenging topic. In this paper, a deep Road Scene Segmentation Network（RSSNet） is presented, which is a 32-layer full convolutional network composed of convolution encoded network and deconvolution decoded network. The batch normalization layer used in the RSSNet prevents the vanishing gradient problem from appearing during the training process; the activation layer using the Maxout function further weakens the vanishing gradient and avoids the network falling into a saturated mode and neuron death phenomenon; moreover, the RSSNet using dropout operation prevents the over-fitting phenomenon of the network model; the max-pool indices of the feature map saved by the encoded-network are used in the decoded-network to upsample the feature map, which keeps the important edge information down. The experimental results show that the RSSNet can greatly improve the training efficiency and the segmentation accuracy, effectively classify each pixel in the road scene image and smoothly segment the objects, and provide useful information of road environment for driverless cars.

Key words: deep learning, convolutional neural network, scenes understanding, semantic segmentation

摘要： 在无人驾驶技术中，道路场景的理解是一个非常重要的环境感知任务，也是一个很具有挑战性的课题。提出了一个深层的道路场景分割网络（Road Scene Segmentation Network，RSSNet），该网络为32层的全卷积神经网络，由卷积编码网络和反卷积解码网络组成。网络中采用批正则化层防止了深度网络在训练中容易出现的“梯度消失”问题；在激活层中采用了Maxout激活函数，进一步缓解了梯度消失，避免网络陷入饱和模式以及出现神经元死亡现象；同时在网络中适当使用Dropout操作，防止了模型出现过拟合现象；编码网络存储了特征图的最大池化索引并在解码网络中使用它们，保留了重要的边缘信息。实验证明，该网络能够大大提高训练效率和分割精度，有效识别道路场景图像中各像素的类别并对目标进行平滑分割，为无人驾驶汽车提供有价值的道路环境信息。

关键词: 深度学习, 卷积神经网络, 场景理解, 语义分割

WU Zongsheng1，2, FU Weiping2, HAN Gaining1. Road scene understanding based on deep convolutional neural network[J]. Computer Engineering and Applications, 2017, 53(22): 8-15.

吴宗胜1，2，傅卫平2，韩改宁1. 基于深度卷积神经网络的道路场景理解[J]. 计算机工程与应用, 2017, 53(22): 8-15.

[1]	MOU Qingping, ZHANG Ying, ZHANG Dongbo, WANG Xinjie, YANG Zhiqiao. Research on Visual Tracking Algorithm and Application of Target Loss Discrimination Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 140-147.
[2]	BAO Zhiqiang, XING Yu, LYU Shaoqing, HUANG Qiongdan. Improved YOLO V2 6D Object Pose Estimation Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 148-153.
[3]	HUANG Dongyi, YANG Bing, WU Zihao, KUANG Jiayi, YAN Zeming. Spatio-Temporal Fully Connected Convolutional Neural Networks for Citywide Cellular Prediction [J]. Computer Engineering and Applications, 2021, 57(9): 168-175.
[4]	ZHAO Zhiyan, YANG Hua, HU Zhiwei, YU Haiping. Identification Model of Pests on Yuluxiang Pear Leaves Based on TACNN [J]. Computer Engineering and Applications, 2021, 57(9): 176-181.
[5]	ZHOU Lungang, SUN Yifeng, WANG Kun, WU Jiang, HUANG Weigui, LI Binglong. End to End Object Recognition Algorithm for Multi-attributes of Multi-values [J]. Computer Engineering and Applications, 2021, 57(9): 182-190.
[6]	ZHANG Cheng, DAI Junfeng, XIONG Wenxin. Improved Handwritten Date Recognition in Scanned Documents Combined with LeNet-5 [J]. Computer Engineering and Applications, 2021, 57(9): 207-211.
[7]	MA Zhexu, YANG Feng, QIAO Xu. Intelligent Detection Method of Railway Subgrade Defect [J]. Computer Engineering and Applications, 2021, 57(9): 272-278.
[8]	WU Wenjie, SONG Wen’ai, GAO Xuemei, YANG Jijiang, WANG Qing, HUANG Liping, LEI Yi. Review of X-Ray-Based Computer-Aided Diagnosis of Adult OSA [J]. Computer Engineering and Applications, 2021, 57(9): 1-8.
[9]	RAN Rong, XU Xinghua, QIU Shaohua, CUI Xiaopeng, OUYANG Bin. Review of Crack Detection Methods Based on Deep Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(9): 23-35.
[10]	LI Xiaoxiao, HU Xiaoguang, WANG Ziqiang, DU Zhuoqun. Survey of Instance Segmentation Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(9): 60-67.
[11]	XU Shaojie, CAO Chuqing, WANG Yongjuan. Application Research of Visual SLAM in Indoor Dynamic Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 175-179.
[12]	LI Mingshan, HAN Qingpeng, ZHANG Tianyu, WANG Daolei. Safety Helmet Detection Method of Improved SSD [J]. Computer Engineering and Applications, 2021, 57(8): 192-197.
[13]	ZENG Chunyan, YAN Kang, WANG Zhifeng, YU Yan, JI Chunmei. Survey of Interpretability Research on Deep Learning Models [J]. Computer Engineering and Applications, 2021, 57(8): 1-9.
[14]	XU Degang, WANG Lu, LI Fan. Review of Typical Object Detection Algorithms for Deep Learning [J]. Computer Engineering and Applications, 2021, 57(8): 10-25.
[15]	JIANG Bin, ZHONG Rui, ZHANG Qiuwen, ZHANG Huanlong. Survey of Non-frontal Facial Expression Recognition by Using Deep Learning Methods [J]. Computer Engineering and Applications, 2021, 57(8): 48-61.

Road scene understanding based on deep convolutional neural network

基于深度卷积神经网络的道路场景理解

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics