Computer Engineering and Applications ›› 2013, Vol. 49 ›› Issue (8): 257-260.

Previous Articles     Next Articles

Application of action prediction in multi-robot reinforcement learning cooperation

CAO Jie, ZHU Ningning   

  1. College of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China
  • Online:2013-04-15 Published:2013-04-15

动作预测在多机器人强化学习协作中的应用

曹  洁,朱宁宁   

  1. 兰州理工大学 计算机与通信学院,兰州 730050

Abstract: In multi-robot systems, the spatial scale of reinforcement learning of the cooperation environment exploration is made up of the exponential function of the number of robots. And the enormous learning space results in the slow convergence rate. To solve this problem, a prediction-based reinforcement learning algorithm and the action selection strategy are applied to the research on multi-robot cooperation. By predicting the probability of actions that other robots may execute, the convergence rate of this algorithm is accelerated. The experimental results show that reinforcement learning algorithm based-on action prediction can achieve the multi-robot’s cooperation strategy much faster, compared to the primitive algorithm.

Key words: action prediction, reinforcement learning, multi-robot cooperation

摘要: 在多机器人系统中,协作环境探索的强化学习的空间规模是机器人个数的指数函数,学习空间非常庞大造成收敛速度极慢。为了解决这个问题,将基于动作预测的强化学习方法及动作选择策略应用于多机器人协作研究中,通过预测机器人可能执行动作的概率以加快学习算法的收敛速度。实验结果表明,基于动作预测的强化学习方法能够比原始算法更快速地获取多机器人的协作策略。

关键词: 动作预测, 强化学习, 多机器人协作