Improved YOLO V2 6D Object Pose Estimation Algorithm

doi:10.3778/j.issn.1002-8331.2001-0367

Abstract

Abstract:

For the 3D pose estimation of the target, combined with the target detection model based on deep learning, 6D target pose estimation algorithm based on improved YOLO V2 is proposed. The feature information of an object in an RGB image is extracted by a convolutional neural network. Based on 2D detection, target position information is mapped to the three-dimensional space. The point-to-point mapping relationship is used to match and calculate target freedom degree in three dimensions. Then, target 6D pose is estimated. The algorithm detects a target in an RGB image. At the same time, target 6D attitude is predicted, which does not require additional post-processing. Experimental results show that the proposed algorithm performs better on other LineMod and Occlusion LineMod datasets than other CNN-based methods recently proposed. The proposed algorithm runs at 37?frames per second on Titan X GPU and can be processed in real time.

Key words: pose estimation, object detection, convolutional neural network, feature extraction

摘要：

针对目标的三维姿态估计，结合基于深度学习的目标检测模型，提出一种基于改进YOLO V2的6D目标姿态估计算法。通过卷积神经网络提取一幅RGB图像中目标的特征信息；在2D检测的基础上将目标的位置信息映射到三维空间；利用点到点的映射关系在三维空间匹配并计算目标的自由度，进而估计目标的6D姿态。该算法不仅能检测单幅RGB图像中的目标，还可以预测目标的6D姿态，同时不需要额外的后处理过程。实验表明，该算法在LineMod和Occlusion LineMod数据集上的性能优于最近提出的其他基于CNN的方法，在Titan X GPU上的运行速度是37?frame/s，适合实时处理。

关键词: 姿态估计, 目标检测, 卷积神经网络, 特征提取

BAO Zhiqiang, XING Yu, LYU Shaoqing, HUANG Qiongdan. Improved YOLO V2 6D Object Pose Estimation Algorithm[J]. Computer Engineering and Applications, 2021, 57(9): 148-153.

包志强，邢瑜，吕少卿，黄琼丹. 改进YOLO V2的6D目标姿态估计算法[J]. 计算机工程与应用, 2021, 57(9): 148-153.

[1]	RAN Rong, XU Xinghua, QIU Shaohua, CUI Xiaopeng, OUYANG Bin. Review of Crack Detection Methods Based on Deep Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(9): 23-35.
[2]	MOU Qingping, ZHANG Ying, ZHANG Dongbo, WANG Xinjie, YANG Zhiqiao. Research on Visual Tracking Algorithm and Application of Target Loss Discrimination Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 140-147.
[3]	HUANG Dongyi, YANG Bing, WU Zihao, KUANG Jiayi, YAN Zeming. Spatio-Temporal Fully Connected Convolutional Neural Networks for Citywide Cellular Prediction [J]. Computer Engineering and Applications, 2021, 57(9): 168-175.
[4]	ZHAO Zhiyan, YANG Hua, HU Zhiwei, YU Haiping. Identification Model of Pests on Yuluxiang Pear Leaves Based on TACNN [J]. Computer Engineering and Applications, 2021, 57(9): 176-181.
[5]	ZHOU Lungang, SUN Yifeng, WANG Kun, WU Jiang, HUANG Weigui, LI Binglong. End to End Object Recognition Algorithm for Multi-attributes of Multi-values [J]. Computer Engineering and Applications, 2021, 57(9): 182-190.
[6]	ZHANG Cheng, DAI Junfeng, XIONG Wenxin. Improved Handwritten Date Recognition in Scanned Documents Combined with LeNet-5 [J]. Computer Engineering and Applications, 2021, 57(9): 207-211.
[7]	WANG Bo, SONG Dan, WANG Hongyu. Research on Key Technologies of UAV Autonomous Inspection System [J]. Computer Engineering and Applications, 2021, 57(9): 255-263.
[8]	MA Zhexu, YANG Feng, QIAO Xu. Intelligent Detection Method of Railway Subgrade Defect [J]. Computer Engineering and Applications, 2021, 57(9): 272-278.
[9]	XU Degang, WANG Lu, LI Fan. Review of Typical Object Detection Algorithms for Deep Learning [J]. Computer Engineering and Applications, 2021, 57(8): 10-25.
[10]	LI Zhenxiao, SUN Wei, LIU Mingming, ZHENG Lili, CHEN Shaoying. Research on Vehicle Detection and Tracking Algorithms in Traffic Monitoring Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 103-111.
[11]	ZHANG Yue, HUANG Yourui, LIU Pengkun. Research on Multi-resolution Human Pose Estimation with Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 126-132.
[12]	XU Shaojie, CAO Chuqing, WANG Yongjuan. Application Research of Visual SLAM in Indoor Dynamic Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 175-179.
[13]	DONG Peng, ZHOU Feng, ZHAO Congcong, WANG Yafei, MI Zetian, FU Xianping. Automatic Measurement of Underwater Sea Cucumber Size Based on Binocular Vision [J]. Computer Engineering and Applications, 2021, 57(8): 271-278.
[14]	LIANG Fangxuan, YANG Feng, LU Liyun, YIN Mengxiao. Review of Brain Tumor Segmentation Methods Based on Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(7): 34-43.
[15]	YANG Peiwei, ZHOU Yuhong, XING Gang, TIAN Zhiqiang, XU Xiayu. Applications of Convolutional Neural Network in Biomedical Image [J]. Computer Engineering and Applications, 2021, 57(7): 44-58.

Improved YOLO V2 6D Object Pose Estimation Algorithm

改进YOLO V2的6D目标姿态估计算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics