计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (13): 1-5.

• 博士论坛 • 上一篇    下一篇

分层强化学习研究进展

程晓北,沈 晶,刘海波,顾国昌,张国印   

  1. 哈尔滨工程大学 计算机科学与技术学院,哈尔滨 150001
  • 收稿日期:2008-04-10 修回日期:2008-04-15 出版日期:2008-05-01 发布日期:2008-05-01
  • 通讯作者: 程晓北

Advances in hierarchical reinforcement learning

CHENG Xiao-bei,SHEN Jing,LIU Hai-bo,GU Guo-chang,ZHANG Guo-yin   

  1. School of Computer Science and Technology,Harbin Engineering University,Harbin 150001,China
  • Received:2008-04-10 Revised:2008-04-15 Online:2008-05-01 Published:2008-05-01
  • Contact: CHENG Xiao-bei

摘要: 强化学习通过试错与环境交互获得策略的改进,其自学习和在线学习的特点使其成为机器学习研究的一个重要分支。但强化学习方法一直被维数灾难所困扰。近年来,分层强化学习方法在解决维数灾问题中取得了显著成果,并逐渐开始向多智能体系统推广,论文归纳分析这一领域目前的研究进展,并对迫切需要解决的一些问题和进一步的发展趋势作出探讨和展望。

关键词: 分层强化学习, 多智能体系统, 维数灾难

Abstract: Reinforcement learning is an approach that an agent can learn its behaviors through trial-and-error interaction with a dynamic environment.It has been an important branch of machine learning for its self-learning and online learning capabilities. But reinforcement learning is bedeviled by the curse of dimensionality.Recently,Hierarchical Reinforcement Learning(HRL) has made great progresses to combat the curse of dimensionality.And the HRL approaches have been being applied to multi-agent system.The recent advances in HRL are surveyed in this paper.Then,some open problems are discussed.Finally,the HRL prospects are shown.

Key words: Hierarchical Reinforcement Learning, multi-agent system, curse of dimensionality