引入生成对抗网络的室外场景单目深度估计

doi:10.3778/j.issn.1002-8331.2001-0019

计算机工程与应用 ›› 2021, Vol. 57 ›› Issue (6): 176-183.DOI: 10.3778/j.issn.1002-8331.2001-0019

引入生成对抗网络的室外场景单目深度估计

邹承明，胡佑璞

1.武汉理工大学计算机科学与技术学院，武汉 430000
2.交通物联网技术湖北省重点实验室，武汉 430000
3.鹏城实验室，广东深圳 518055

出版日期:2021-03-15 发布日期:2021-03-12

Monocular Depth Estimation in Outdoor Scene with Generative Adversarial Network

ZOU Chengming, HU Youpu

1.School of Computer Science and Technology, Wuhan University of Technology, Wuhan 430000, China
2.Hubei Key Laboratory of Transportation Internet of Things, Wuhan 430000, China
3.Peng Cheng Laboratory, Shenzhen, Guangdong 518055, China

Online:2021-03-15 Published:2021-03-12

摘要/Abstract

摘要：

生成对抗网络（GAN）算法在室外场景的深度估计任务中准确率较低，对于物体边界判断不准确。针对该问题，提出基于循环生成对抗网络（CycleGAN）的单目深度估计算法，将单幅图像映射到深度图像的过程拆分为两个子阶段。第一阶段中，网络学习图像的基本空间特征，得到粗糙尺度下的深度图像；第二阶段在前者的基础上，通过细节上的差异对比，优化深度图像，得到精细尺度下的深度图像。为了进一步提高深度估计的精度，在损失函数中引入了L1距离，让网络可以学习像素到像素的映射关系，避免出现较大的偏差与失真。在公开的室外场景数据集Make3D上的实验结果表明，与同类型算法相比，该算法的平均相对误差、均方根误差取得更好的效果。

关键词: 深度估计, 生成对抗网络, 图像转换, 半监督学习, 深度学习

Abstract:

The Generative Adversarial Network（GAN） has a low accuracy rate in the depth estimation task in outdoor scenes, it is inaccurate for object boundary judgment. Focusing on this problem, this paper proposes a monocular depth estimation algorithm based on Cycle Generation Adversarial Network（CycleGAN）. The algorithm splits the process of mapping a single image to a depth image into two sub-stages. In the first stage, the network learns the basic spatial characteristics of the image to obtain a depthmap at a coarse scale. On the basis of the former, the second stage optimizes the depthmap by comparing the differences in details to obtain a depthmap at a fine scale. In order to further improve the accuracy of depth estimation, the L1 distance is introduced into the loss function, so that the network can learn the pixel-to-pixel mapping relationship and avoid large deviations and distortions. Experimental results on the public outdoor scene dataset Make3D show that, compared with similar algorithms, this algorithm achieve better results in average relative error and root mean square error.

Key words: depth estimation, Generative Adversarial Network（GAN）, image conversion, semi-supervised learning, deep learning

邹承明，胡佑璞. 引入生成对抗网络的室外场景单目深度估计[J]. 计算机工程与应用, 2021, 57(6): 176-183.

ZOU Chengming, HU Youpu. Monocular Depth Estimation in Outdoor Scene with Generative Adversarial Network[J]. Computer Engineering and Applications, 2021, 57(6): 176-183.

[1]	张波，徐黎明，黄志伟，要小鹏. 梯度策略的多目标GANs帕累托最优解算法[J]. 计算机工程与应用, 2021, 57(9): 89-95.
[2]	黄冬宜，杨兵，吴子豪，匡佳一，颜泽明. 用于全市蜂窝流量预测的时空全连接卷积网络[J]. 计算机工程与应用, 2021, 57(9): 168-175.
[3]	周伦钢，孙怡峰，王坤，吴疆，黄维贵，李炳龙. 目标多种多值属性的端端快速识别网络[J]. 计算机工程与应用, 2021, 57(9): 182-190.
[4]	柴旭，方明，付飞蚺，邵桢. 考场环境下考生视线估计方法[J]. 计算机工程与应用, 2021, 57(9): 199-206.
[5]	张成，戴俊峰，熊闻心. 融合LeNet-5改进的扫描文档手写日期识别[J]. 计算机工程与应用, 2021, 57(9): 207-211.
[6]	吴文龙，周喜，王轶，王保全. WKAG：一种针对不平衡医保数据的欺诈检测方法[J]. 计算机工程与应用, 2021, 57(9): 247-254.
[7]	武文杰，宋文爱，高雪梅，杨吉江，王青，黄丽萍，雷毅. 基于X线的成人OSA计算机辅助诊断综述[J]. 计算机工程与应用, 2021, 57(9): 1-8.
[8]	冉蓉，徐兴华，邱少华，崔小鹏，欧阳斌. 基于深度卷积神经网络的裂纹检测方法综述[J]. 计算机工程与应用, 2021, 57(9): 23-35.
[9]	李晓筱，胡晓光，王梓强，杜卓群. 基于深度学习的实例分割研究进展[J]. 计算机工程与应用, 2021, 57(9): 60-67.
[10]	李明山，韩清鹏，张天宇，王道累. 改进SSD的安全帽检测方法[J]. 计算机工程与应用, 2021, 57(8): 192-197.
[11]	曾春艳，严康，王志锋，余琰，纪纯妹. 深度学习模型可解释性研究综述[J]. 计算机工程与应用, 2021, 57(8): 1-9.
[12]	许德刚，王露，李凡. 深度学习的典型目标检测算法研究综述[J]. 计算机工程与应用, 2021, 57(8): 10-25.
[13]	王晋宇，杨海涛，李高源，张长弓，冯博迪. 生成对抗网络及其图像处理应用研究进展[J]. 计算机工程与应用, 2021, 57(8): 26-35.
[14]	蒋斌，钟瑞，张秋闻，张焕龙. 采用深度学习方法的非正面表情识别综述[J]. 计算机工程与应用, 2021, 57(8): 48-61.
[15]	赵圆丽，梁志剑. 基于异核卷积双注意机制的立场检测研究[J]. 计算机工程与应用, 2021, 57(8): 119-125.

引入生成对抗网络的室外场景单目深度估计

Monocular Depth Estimation in Outdoor Scene with Generative Adversarial Network

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics