计算机工程与应用 ›› 2020, Vol. 56 ›› Issue (17): 156-160.DOI: 10.3778/j.issn.1002-8331.1907-0044

• 模式识别与人工智能 • 上一篇    下一篇

深度强化学习在室内无人机目标搜索中的应用

赖俊,饶瑞   

  1. 陆军工程大学 指挥控制工程学院,南京 210007
  • 出版日期:2020-09-01 发布日期:2020-08-31

Application of Deep Reinforcement Learning in Indoor UAV Target Search

LAI Jun, RAO Rui   

  1. College of Command and Control Engineering, Army Engineering University of PLA, Nanjing 210007, China
  • Online:2020-09-01 Published:2020-08-31

摘要:

针对室内无人机随机目标搜索效率不高、准确率低等问题,提出了一种基于空间位置标注的好奇心驱动的深度强化学习方法。用正六边形对探索空间进行区域划分,并标记无人机在各区域的访问次数,将其作为好奇心,产生内部奖励,以鼓励无人机不断探索新领域,有效避免其陷入到局部区域;训练时采用近端策略优化算法(PPO)优化神经网络参数,该算法能使无人机更快找到最优搜索策略,较好躲避障碍物,有效缩短训练周期,提升搜索效率和准确率。

关键词: 深度强化学习, 室内搜索, 好奇心

Abstract:

Inview of the low efficiency and low accuracy of indoor random target search by UAV, this paper proposes the deep reinforcement learning algorithm by curiosity-driven exploration based on spatial location annotation. Firstly, it divides the exploration space by regular hexagon, and records the number of the UAV’s visiting in each single area. Then, it generates the internal rewards by the visiting records, which can encourage the UAV to explore new areas continuously and effectively avoid LUAV sinking into local areas. When training the neural network, it uses PPO(Proximal Policy Optimization) algorithm to optimize the parameters, which can find the optimal search strategy faster, avoid the obstacles better, shorten the training period, and improve the search efficiency and accuracy.

Key words: deep reinforcement learning, indoor search, curiosity