双路融合的深度估计神经网络方法研究

doi:10.3778/j.issn.1002-8331.1908-0002

计算机工程与应用 ›› 2020, Vol. 56 ›› Issue (20): 138-145.DOI: 10.3778/j.issn.1002-8331.1908-0002

双路融合的深度估计神经网络方法研究

刘春，吴一珩

湖北工业大学计算机学院，武汉 430068

出版日期:2020-10-15 发布日期:2020-10-13

Research on Neural Network Method for Depth Estimation Based on Two-Way Fusion

LIU Chun, WU Yiheng

School of Computer Science, Hubei University of Technology, Wuhan 430068, China

Online:2020-10-15 Published:2020-10-13

摘要/Abstract

摘要：

从单目视觉中恢复深度信息是计算机视觉领域的经典问题，结合传统算法的深度学习方法是近年来的研究热点，但在神经网络的算法融合、参照物标定和应用场景上还有限制。提出了一种双路融合深度估计神经网络结构，分别基于深度与深度梯度的语义信息进行网络训练，对特征融合后再次训练得到最终的细节特征，并通过单次标定的方法解决真实参照物标定工作量大的问题。该网络结构能根据单张RGB图片推测出富有细节的深度信息，网络模型基于KITTI的深度图数据集训练，实验包括KITTI测试集和部分实际场景图集，结果表明该方法在深度信息细节的重建上优于对比深度估计方案，在大视场场景下的鲁棒性优良。

关键词: 深度估计, 单目视觉, 人工智能, 神经网络

Abstract:

Recovering depth information from monocular vision is a classical problem in the field of computer vision. The deep learning method combined with traditional algorithms is a hot research topic in recent years, but there are still limitations in algorithm fusion of neural networks, reference calibration and application scenarios. This paper proposes a two-way fusion depth estimation neural network structure, which trains the network based on the semantic information of depth gradient and depth gradient respectively, then trains the final detailed features again after feature fusion, and solves the problem of heavy workload of real reference calibration by single calibration method. The network structure can infer detailed depth information from a single RGB image. The network model is based on KITTI depth map data set training. Experiments include KITTI test set and some actual scene atlas. The results show that the method is superior to the contrast depth estimation scheme in depth information detail reconstruction and in large field of view scenario with good robustness.

Key words: depth estimation, monocular vision, artificial intelligence, neural network

刘春，吴一珩. 双路融合的深度估计神经网络方法研究[J]. 计算机工程与应用, 2020, 56(20): 138-145.

LIU Chun, WU Yiheng. Research on Neural Network Method for Depth Estimation Based on Two-Way Fusion[J]. Computer Engineering and Applications, 2020, 56(20): 138-145.

[1]	陶林娟, 华庚兴, 李波. 基于位置增强词向量和GRU-CNN的方面级情感分析模型研究[J]. 计算机工程与应用, 2024, 60(9): 212-218.
[2]	廉露, 田启川, 谭润, 张晓行. 基于神经网络的图像风格迁移研究进展[J]. 计算机工程与应用, 2024, 60(9): 30-47.
[3]	张俊三, 肖森, 高慧, 邵明文, 张培颖, 朱杰. 基于邻域采样的多任务图推荐算法[J]. 计算机工程与应用, 2024, 60(9): 172-180.
[4]	许智宏, 张天润, 王利琴, 董永峰. 融合图谱重构的时序知识图谱推理[J]. 计算机工程与应用, 2024, 60(9): 181-187.
[5]	宋建平, 王毅, 孙开伟, 刘期烈. 结合双曲图注意力网络与标签信息的短文本分类方法[J]. 计算机工程与应用, 2024, 60(9): 188-195.
[6]	杨文涛, 雷雨琦, 李星月, 郑天成. 融合汉字输入法的BERT与BLCG的长文本分类研究[J]. 计算机工程与应用, 2024, 60(9): 196-202.
[7]	邓希泉, 陈刚. ConvUCaps：基于卷积胶囊网络的医学图像分割模型[J]. 计算机工程与应用, 2024, 60(8): 258-266.
[8]	王永贵, 王芯茹. 融合自注意力和图卷积的多视图群组推荐[J]. 计算机工程与应用, 2024, 60(8): 287-295.
[9]	钱平, 韩睿, 谢凌东, 罗旺, 徐华荣, 李松松, 郑振东. 支持抑制型脉冲神经网络的硬件加速器[J]. 计算机工程与应用, 2024, 60(8): 338-347.
[10]	孙石磊, 李明, 刘静, 马金刚, 陈天真. 深度学习在糖尿病视网膜病变分类领域的研究进展[J]. 计算机工程与应用, 2024, 60(8): 16-30.
[11]	汪维泰, 王晓强, 李雷孝, 陶乙豪, 林浩. 时空图神经网络在交通流预测研究中的构建与应用综述[J]. 计算机工程与应用, 2024, 60(8): 31-45.
[12]	谢威宇, 张强. 基于深度学习的图像中无人机与飞鸟检测研究综述[J]. 计算机工程与应用, 2024, 60(8): 46-55.
[13]	宋世林, 张学军. 脑电信号多特征融合与卷积神经网络算法研究[J]. 计算机工程与应用, 2024, 60(8): 148-155.
[14]	姜良, 张程, 魏德健, 曹慧, 杜昱峥. 深度学习在骨质疏松辅助诊断中的应用[J]. 计算机工程与应用, 2024, 60(7): 26-40.
[15]	郑小丽, 王巍, 杜雨晅, 张闯. 面向会话的需求感知注意图神经网络推荐模型[J]. 计算机工程与应用, 2024, 60(7): 128-140.

双路融合的深度估计神经网络方法研究

Research on Neural Network Method for Depth Estimation Based on Two-Way Fusion

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics