基于递推最小二乘法的多步时序差分学习算法
陈学松,杨宜民
Multi-step temporal difference learning algorithm based on recursive least-squares method
CHEN Xue-song,YANG Yi-min
计算机工程与应用 . 2010, (8): 52 -55 .  DOI: 10.3778/j.issn.1002-8331.2010.08.015