面向轨迹规划的深度强化学习奖励函数设计
李跃,邵振洲,赵振东,施智平,关永
Design of Reward Function in Deep Reinforcement Learning for Trajectory Planning
LI Yue, SHAO Zhenzhou, ZHAO Zhendong, SHI Zhiping, GUAN Yong
计算机工程与应用
.
2020, (2): 226
-232
.
DOI: 10.3778/j.issn.1002-8331.1810-0021