Computer Engineering and Applications ›› 2020, Vol. 56 ›› Issue (13): 143-149.DOI: 10.3778/j.issn.1002-8331.1903-0207

Previous Articles     Next Articles

Monocular Depth Estimation Based on Convolutional Neural Network

WANG Xinsheng, ZHANG Guiling   

  1. 1.School of Computer Science & Software Engineering, Tianjin Polytechnic University, Tianjin 300387, China
    2.Tianjin Independent Intelligent Technology and System Key Laboratory, Tianjin 300387, China
  • Online:2020-07-01 Published:2020-07-02



  1. 1.天津工业大学 计算机科学与技术学院,天津 300387
    2.天津市自主智能技术与系统重点实验室,天津 300387


In order to use the convolutional neural network to learn to estimate the depth of a street image, the method of semantic segmentation is proposed to solve the problem of boundary blurring of the depth map. The estimated depth is generated by the left and right perspective maps to generate disparity maps for unsupervised training. The semantic segmentation layer is added in the network model, and multiple atrous convolutions parallel structure is adopted to increase the receptive field, the number of image down sampling is reduced, the information loss during down sampling is decreased, and the result is more accurate. This is also the first time in depth estimation combined with cavity convolution to increase the accuracy. By training the KITTI street dataset, in addition to increasing the detection accuracy and reducing the error rate, the objects in the renderings are made clearer, and some of the original models are retained in the renderings, it ignores the details and shows the original image more completely.

Key words: depth estimation, convolutional neural network, semantic segmentation, self-driving



关键词: 深度估计, 卷积神经网络, 语义分割, 无人驾驶