基于随机方差减小方法的DDPG算法
杨薛钰,陈建平,傅启明,陆悠,吴宏杰
Deep Deterministic Policy Gradient Algorithm Based on Stochastic Variance Reduction Method
YANG Xueyu, CHEN Jianping, FU Qiming, LU You, WU Hongjie
计算机工程与应用 . 2021, (19): 104 -111 .  DOI: 10.3778/j.issn.1002-8331.2009-0097