Computer Engineering and Applications ›› 2020, Vol. 56 ›› Issue (17): 156-160.DOI: 10.3778/j.issn.1002-8331.1907-0044

Previous Articles     Next Articles

Application of Deep Reinforcement Learning in Indoor UAV Target Search

LAI Jun, RAO Rui   

  1. College of Command and Control Engineering, Army Engineering University of PLA, Nanjing 210007, China
  • Online:2020-09-01 Published:2020-08-31



  1. 陆军工程大学 指挥控制工程学院,南京 210007


Inview of the low efficiency and low accuracy of indoor random target search by UAV, this paper proposes the deep reinforcement learning algorithm by curiosity-driven exploration based on spatial location annotation. Firstly, it divides the exploration space by regular hexagon, and records the number of the UAV’s visiting in each single area. Then, it generates the internal rewards by the visiting records, which can encourage the UAV to explore new areas continuously and effectively avoid LUAV sinking into local areas. When training the neural network, it uses PPO(Proximal Policy Optimization) algorithm to optimize the parameters, which can find the optimal search strategy faster, avoid the obstacles better, shorten the training period, and improve the search efficiency and accuracy.

Key words: deep reinforcement learning, indoor search, curiosity



关键词: 深度强化学习, 室内搜索, 好奇心