融合深度学习的机器人目标检测与定位

doi:10.3778/j.issn.1002-8331.1912-0328

摘要/Abstract

摘要：

Tiny-YOLOV3是目标检测领域常用的检测算法，相比较YOLOV3，其优点是神经网络层比较简单，计算量少，且对硬件的配置要求较低，因此可以保证检测的实时性，但由于网络层比较少，检测的精度也较低。为了提高Tiny-YOLOV3在网络中的检测精度，提出一类Tiny-YOLOV3改进模型，调整检测网络架构的损失结构层，以卷积层和特征图的相关系数矩阵表征特征图分布，设计损失函数优化损失特征层分布，增强网络特征的表达能力。结合NAO机器人平台，采用三角函数定位将基于图像的目标检测位置转换为机器人坐标系位置。根据4 000张VOC数据格式自制数据集进行模型训练与测试，针对不同物体在变化位置下进行50次机器人手臂抓取实验。相比原始Tiny-YOLOV3模型，改进的网络模型在分辨率为640×480单张图片的检测速度35 帧/s前提下，检测mAP值提高了4.08%，置信度提高20%。实验结果表明算法在兼顾目标检测时间效率的前提下有效提高了目标检测准确度，可满足机器人在分拣、采摘、监控、服务等多样实时性应用场景需求。

关键词: 目标检测, 机器人, 网络结构, Tiny-YOLOV3, 损失函数

Abstract:

Tiny-YOLOV3 is a common detection algorithm in the field of object detection. Compared with YOLOV3, tiny has the advantages of simpler neural network layer, less computation and lower hardware configuration requirements, so it can ensure the real-time detection. However, due to the lack of network layer, the accuracy of tiny detection is also low. In order to improve the detection accuracy of Tiny-YOLOV3, this paper improves the network structure of Tiny-YOLOV3 by adjusting the loss structure layer of the detection network architecture, and uses the correlation coefficient matrix of the convolution layer and the characteristic graph to represent the distribution of the characteristic graph. In addition, this paper also designs the loss function to optimize the distribution of the loss feature layer and enhances the expression ability of the network features. Finally, combining with the NAO robot, the object detection position of the image is transformed into the robot coordinate system position through the trigonometric function positioning. Based on 4,000 self-made image data set training and testing, this paper designs object detection and grab experiment on the NAO robot under various scene. For 50 times experiments with a detection speed at 35 frames per second（same as the original）, the optimized model improves the map value by 4.08%, the confidence level by 20%, and the touch success rate by 10%. The results show that it meets the requirement of real time object detection and improves the detection accuracy at the same time, which will help a lot in real-time application scenarios such as robot sorting, picking, monitoring, service, etc.

Key words: object detection, robot, network structure, Tiny-YOLOV3, loss function

黄怡蒙，易阳. 融合深度学习的机器人目标检测与定位[J]. 计算机工程与应用, 2020, 56(24): 181-187.

HUANG Yimeng, YI Yang. Robot Object Detection and Localization Based on Deep Learning[J]. Computer Engineering and Applications, 2020, 56(24): 181-187.

[1]	包志强，邢瑜，吕少卿，黄琼丹. 改进YOLO V2的6D目标姿态估计算法[J]. 计算机工程与应用, 2021, 57(9): 148-153.
[2]	周伦钢，孙怡峰，王坤，吴疆，黄维贵，李炳龙. 目标多种多值属性的端端快速识别网络[J]. 计算机工程与应用, 2021, 57(9): 182-190.
[3]	张朕通，单玉刚，袁杰. 联合多尺度和注意力机制的遥感影像检测[J]. 计算机工程与应用, 2021, 57(9): 212-216.
[4]	王博，宋丹，王洪玉. 无人机自主巡检系统的关键技术研究[J]. 计算机工程与应用, 2021, 57(9): 255-263.
[5]	许德刚，王露，李凡. 深度学习的典型目标检测算法研究综述[J]. 计算机工程与应用, 2021, 57(8): 10-25.
[6]	李震霄，孙伟，刘明明，郑丽丽，陈劭颖. 交通监控场景中的车辆检测与跟踪算法研究[J]. 计算机工程与应用, 2021, 57(8): 103-111.
[7]	董旭彬，赵清华. 改进Mask R-CNN在航空影像目标检测的研究应用[J]. 计算机工程与应用, 2021, 57(8): 133-144.
[8]	徐少杰，曹雏清，王永娟. 视觉SLAM在室内动态场景中的应用研究[J]. 计算机工程与应用, 2021, 57(8): 175-179.
[9]	郭晓静，隋昊达. 改进YOLOv3在机场跑道异物目标检测中的应用[J]. 计算机工程与应用, 2021, 57(8): 249-255.
[10]	董鹏，周烽，赵悰悰，王亚飞，米泽田，付先平. 基于双目视觉的水下海参尺寸自动测量方法[J]. 计算机工程与应用, 2021, 57(8): 271-278.
[11]	马巧梅，王明俊，梁昊然. 复杂场景下基于改进YOLOv3的车牌定位检测算法[J]. 计算机工程与应用, 2021, 57(7): 198-208.
[12]	侯旋，薛飞，陈涛. 无人机目标检测量子多模式识别优化算法[J]. 计算机工程与应用, 2021, 57(7): 228-236.
[13]	沈新烽，姜平，周根荣. 改进SSD算法在零部件检测中的应用研究[J]. 计算机工程与应用, 2021, 57(7): 257-262.
[14]	廖列法，李浩瀚，李帅，朱合隆，李志军. 结合Winner-Take-All的足球机器人控制策略研究[J]. 计算机工程与应用, 2021, 57(7): 136-143.
[15]	刘紫燕，袁磊，朱明成，马珊珊，陈霖周廷. 融合SPP和改进FPN的YOLOv3交通标志检测[J]. 计算机工程与应用, 2021, 57(7): 164-170.

融合深度学习的机器人目标检测与定位

Robot Object Detection and Localization Based on Deep Learning

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics