Robot Object Detection and Localization Based on Deep Learning

doi:10.3778/j.issn.1002-8331.1912-0328

Abstract

Abstract:

Tiny-YOLOV3 is a common detection algorithm in the field of object detection. Compared with YOLOV3, tiny has the advantages of simpler neural network layer, less computation and lower hardware configuration requirements, so it can ensure the real-time detection. However, due to the lack of network layer, the accuracy of tiny detection is also low. In order to improve the detection accuracy of Tiny-YOLOV3, this paper improves the network structure of Tiny-YOLOV3 by adjusting the loss structure layer of the detection network architecture, and uses the correlation coefficient matrix of the convolution layer and the characteristic graph to represent the distribution of the characteristic graph. In addition, this paper also designs the loss function to optimize the distribution of the loss feature layer and enhances the expression ability of the network features. Finally, combining with the NAO robot, the object detection position of the image is transformed into the robot coordinate system position through the trigonometric function positioning. Based on 4,000 self-made image data set training and testing, this paper designs object detection and grab experiment on the NAO robot under various scene. For 50 times experiments with a detection speed at 35 frames per second（same as the original）, the optimized model improves the map value by 4.08%, the confidence level by 20%, and the touch success rate by 10%. The results show that it meets the requirement of real time object detection and improves the detection accuracy at the same time, which will help a lot in real-time application scenarios such as robot sorting, picking, monitoring, service, etc.

Key words: object detection, robot, network structure, Tiny-YOLOV3, loss function

摘要：

Tiny-YOLOV3是目标检测领域常用的检测算法，相比较YOLOV3，其优点是神经网络层比较简单，计算量少，且对硬件的配置要求较低，因此可以保证检测的实时性，但由于网络层比较少，检测的精度也较低。为了提高Tiny-YOLOV3在网络中的检测精度，提出一类Tiny-YOLOV3改进模型，调整检测网络架构的损失结构层，以卷积层和特征图的相关系数矩阵表征特征图分布，设计损失函数优化损失特征层分布，增强网络特征的表达能力。结合NAO机器人平台，采用三角函数定位将基于图像的目标检测位置转换为机器人坐标系位置。根据4 000张VOC数据格式自制数据集进行模型训练与测试，针对不同物体在变化位置下进行50次机器人手臂抓取实验。相比原始Tiny-YOLOV3模型，改进的网络模型在分辨率为640×480单张图片的检测速度35 帧/s前提下，检测mAP值提高了4.08%，置信度提高20%。实验结果表明算法在兼顾目标检测时间效率的前提下有效提高了目标检测准确度，可满足机器人在分拣、采摘、监控、服务等多样实时性应用场景需求。

关键词: 目标检测, 机器人, 网络结构, Tiny-YOLOV3, 损失函数

HUANG Yimeng, YI Yang. Robot Object Detection and Localization Based on Deep Learning[J]. Computer Engineering and Applications, 2020, 56(24): 181-187.

黄怡蒙，易阳. 融合深度学习的机器人目标检测与定位[J]. 计算机工程与应用, 2020, 56(24): 181-187.

[1]	BAO Zhiqiang, XING Yu, LYU Shaoqing, HUANG Qiongdan. Improved YOLO V2 6D Object Pose Estimation Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 148-153.
[2]	ZHOU Lungang, SUN Yifeng, WANG Kun, WU Jiang, HUANG Weigui, LI Binglong. End to End Object Recognition Algorithm for Multi-attributes of Multi-values [J]. Computer Engineering and Applications, 2021, 57(9): 182-190.
[3]	WANG Bo, SONG Dan, WANG Hongyu. Research on Key Technologies of UAV Autonomous Inspection System [J]. Computer Engineering and Applications, 2021, 57(9): 255-263.
[4]	XU Degang, WANG Lu, LI Fan. Review of Typical Object Detection Algorithms for Deep Learning [J]. Computer Engineering and Applications, 2021, 57(8): 10-25.
[5]	LI Zhenxiao, SUN Wei, LIU Mingming, ZHENG Lili, CHEN Shaoying. Research on Vehicle Detection and Tracking Algorithms in Traffic Monitoring Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 103-111.
[6]	XU Shaojie, CAO Chuqing, WANG Yongjuan. Application Research of Visual SLAM in Indoor Dynamic Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 175-179.
[7]	GUO Xiaojing, SUI Haoda. Application of Improved YOLOv3 in Foreign Object Debris Target Detection on Airfield Pavement [J]. Computer Engineering and Applications, 2021, 57(8): 249-255.
[8]	DONG Peng, ZHOU Feng, ZHAO Congcong, WANG Yafei, MI Zetian, FU Xianping. Automatic Measurement of Underwater Sea Cucumber Size Based on Binocular Vision [J]. Computer Engineering and Applications, 2021, 57(8): 271-278.
[9]	LIAO Liefa, LI Haohan, LI Shuai, ZHU Helong, LI Zhijun. Research on Control Strategy of Soccer Robot Combined with Winner-Take-All [J]. Computer Engineering and Applications, 2021, 57(7): 136-143.
[10]	LIU Ziyan, YUAN Lei, ZHU Mingcheng, MA Shanshan, CHEN Linzhouting. YOLOv3 Traffic sign Detection based on SPP and Improved FPN [J]. Computer Engineering and Applications, 2021, 57(7): 164-170.
[11]	YUN Xu, SONG Huansheng, LIANG Haoxiang, HOU Jingyan, DAI Zhe. Personnel Behavior Analysis System for Key Positions Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(6): 225-231.
[12]	ZHOU Youhang, ZHAO Hanyun, LIU Hanjiang, LI Yuze, XIAO Yuqin. Self-Learning Gait Planning Method for Biped Robot Using DDPG [J]. Computer Engineering and Applications, 2021, 57(6): 254-259.
[13]	XIAO Yuqing, YANG Huimin. Research on Application of Object Detection Algorithm in Traffic Scene [J]. Computer Engineering and Applications, 2021, 57(6): 30-41.
[14]	WANG Di, LI Caihong, GUO Na, LIU Guoming, GAO Tengteng. Local Path Planning of Mobile Robot Based on Fuzzy Potential Field Method [J]. Computer Engineering and Applications, 2021, 57(6): 212-218.
[15]	JIANG Lin, FANG Dongjun, LEI Bin, LI Weigang. Research Status and Trend of Navigation Algorithm for Mobile Robot with Monocular Vision [J]. Computer Engineering and Applications, 2021, 57(5): 1-9.

Robot Object Detection and Localization Based on Deep Learning

融合深度学习的机器人目标检测与定位

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics