计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (19): 39-40.

• 理论研究 • 上一篇    下一篇

基于ACCA的Option自动生成算法

胡明辉,殷苌茗,李立云   

  1. 长沙理工大学 计算机与通信工程学院,长沙 410076
  • 收稿日期:2007-09-27 修回日期:2007-12-13 出版日期:2008-07-01 发布日期:2008-07-01
  • 通讯作者: 胡明辉

Option automatic generation algorithm based on ACCA

HU Ming-hui,YIN Chang-ming,LI Li-yun

  

  1. College of Computer and Communicational Engineering,Changsha University of Science and Technology,Changsha 410076,China
  • Received:2007-09-27 Revised:2007-12-13 Online:2008-07-01 Published:2008-07-01
  • Contact: HU Ming-hui

摘要: 提出了一种新的分层强化学习(HRL)Option自动生成算法,以Agent在学习初始阶段探测到的状态空间为输入,并采用改进的蚁群聚类算法(ACCA)对其进行聚类,在聚类后的各状态子集上通过经验回放学习产生内部策略集,从而生成Option,仿真实验验证了该算法是有效的。

关键词: 分层强化学习, Option, 蚁群聚类算法, 经验回放

Abstract: A new algorithm for Option automatic generation of hierarchical reinforcement learning is presented.The algorithm takes the state space explored by Agent as input in the initial learning phase and clusters the states employing Ant Colony Clustering Algorithm(ACCA).Based on the clustered state sets,the intra-strategies are learned by an experience replay procedure.As a result,the Options are generated.The validity of the algorithm is demonstrated by simulation experiments.

Key words: hierarchical reinforcement learning, Option, Ant Colony Clustering Algorithm(ACCA), experience replay