Monocular Depth Estimation in Outdoor Scene with Generative Adversarial Network

doi:10.3778/j.issn.1002-8331.2001-0019

Abstract

Abstract:

The Generative Adversarial Network（GAN） has a low accuracy rate in the depth estimation task in outdoor scenes, it is inaccurate for object boundary judgment. Focusing on this problem, this paper proposes a monocular depth estimation algorithm based on Cycle Generation Adversarial Network（CycleGAN）. The algorithm splits the process of mapping a single image to a depth image into two sub-stages. In the first stage, the network learns the basic spatial characteristics of the image to obtain a depthmap at a coarse scale. On the basis of the former, the second stage optimizes the depthmap by comparing the differences in details to obtain a depthmap at a fine scale. In order to further improve the accuracy of depth estimation, the L1 distance is introduced into the loss function, so that the network can learn the pixel-to-pixel mapping relationship and avoid large deviations and distortions. Experimental results on the public outdoor scene dataset Make3D show that, compared with similar algorithms, this algorithm achieve better results in average relative error and root mean square error.

Key words: depth estimation, Generative Adversarial Network（GAN）, image conversion, semi-supervised learning, deep learning

摘要：

生成对抗网络（GAN）算法在室外场景的深度估计任务中准确率较低，对于物体边界判断不准确。针对该问题，提出基于循环生成对抗网络（CycleGAN）的单目深度估计算法，将单幅图像映射到深度图像的过程拆分为两个子阶段。第一阶段中，网络学习图像的基本空间特征，得到粗糙尺度下的深度图像；第二阶段在前者的基础上，通过细节上的差异对比，优化深度图像，得到精细尺度下的深度图像。为了进一步提高深度估计的精度，在损失函数中引入了L1距离，让网络可以学习像素到像素的映射关系，避免出现较大的偏差与失真。在公开的室外场景数据集Make3D上的实验结果表明，与同类型算法相比，该算法的平均相对误差、均方根误差取得更好的效果。

关键词: 深度估计, 生成对抗网络, 图像转换, 半监督学习, 深度学习

ZOU Chengming, HU Youpu. Monocular Depth Estimation in Outdoor Scene with Generative Adversarial Network[J]. Computer Engineering and Applications, 2021, 57(6): 176-183.

邹承明，胡佑璞. 引入生成对抗网络的室外场景单目深度估计[J]. 计算机工程与应用, 2021, 57(6): 176-183.

[1]	HUANG Dongyi, YANG Bing, WU Zihao, KUANG Jiayi, YAN Zeming. Spatio-Temporal Fully Connected Convolutional Neural Networks for Citywide Cellular Prediction [J]. Computer Engineering and Applications, 2021, 57(9): 168-175.
[2]	ZHOU Lungang, SUN Yifeng, WANG Kun, WU Jiang, HUANG Weigui, LI Binglong. End to End Object Recognition Algorithm for Multi-attributes of Multi-values [J]. Computer Engineering and Applications, 2021, 57(9): 182-190.
[3]	ZHANG Cheng, DAI Junfeng, XIONG Wenxin. Improved Handwritten Date Recognition in Scanned Documents Combined with LeNet-5 [J]. Computer Engineering and Applications, 2021, 57(9): 207-211.
[4]	WU Wenjie, SONG Wen’ai, GAO Xuemei, YANG Jijiang, WANG Qing, HUANG Liping, LEI Yi. Review of X-Ray-Based Computer-Aided Diagnosis of Adult OSA [J]. Computer Engineering and Applications, 2021, 57(9): 1-8.
[5]	RAN Rong, XU Xinghua, QIU Shaohua, CUI Xiaopeng, OUYANG Bin. Review of Crack Detection Methods Based on Deep Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(9): 23-35.
[6]	LI Xiaoxiao, HU Xiaoguang, WANG Ziqiang, DU Zhuoqun. Survey of Instance Segmentation Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(9): 60-67.
[7]	LI Mingshan, HAN Qingpeng, ZHANG Tianyu, WANG Daolei. Safety Helmet Detection Method of Improved SSD [J]. Computer Engineering and Applications, 2021, 57(8): 192-197.
[8]	ZENG Chunyan, YAN Kang, WANG Zhifeng, YU Yan, JI Chunmei. Survey of Interpretability Research on Deep Learning Models [J]. Computer Engineering and Applications, 2021, 57(8): 1-9.
[9]	XU Degang, WANG Lu, LI Fan. Review of Typical Object Detection Algorithms for Deep Learning [J]. Computer Engineering and Applications, 2021, 57(8): 10-25.
[10]	WANG Jinyu, YANG Haitao, LI Gaoyuan, ZHANG Changgong, FENG Bodi. Research Progress of Generative Adversarial Network and Its Application in Image Processing [J]. Computer Engineering and Applications, 2021, 57(8): 26-35.
[11]	JIANG Bin, ZHONG Rui, ZHANG Qiuwen, ZHANG Huanlong. Survey of Non-frontal Facial Expression Recognition by Using Deep Learning Methods [J]. Computer Engineering and Applications, 2021, 57(8): 48-61.
[12]	ZHAO Yuanli, LIANG Zhijian. Research on Stance Detection Based on Dual Attention Mechanism of Heteronuclear Convolution [J]. Computer Engineering and Applications, 2021, 57(8): 119-125.
[13]	LI Jian, SUN Dasong, ZHANG Beiwei. Image Restoration Using Dual-Encoder and Adversarial Training [J]. Computer Engineering and Applications, 2021, 57(7): 192-197.
[14]	YANG Bo, TAO Qingchuan, DONG Peijun. Surgical Instrument Segmentation Method Based on Improved Deeplab v3+ Network [J]. Computer Engineering and Applications, 2021, 57(7): 222-227.
[15]	LIU Di, JIA Jinlu, ZHAO Yuqing, QIAN Yurong. Overview of Image Denoising Methods Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(7): 1-13.

Monocular Depth Estimation in Outdoor Scene with Generative Adversarial Network

引入生成对抗网络的室外场景单目深度估计

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics