基于CMAC的非参数化近似策略迭代增强学习
季 挺,张 华
Nonparametric Approximation Policy Iteration Reinforcement Learning Based on CMAC
JI Ting, ZHANG Hua
计算机工程与应用 . 2019, (2): 128 -136 .  DOI: 10.3778/j.issn.1002-8331.1709-0489