面向轨迹规划的深度强化学习奖励函数设计
李跃,邵振洲,赵振东,施智平,关永
Design of Reward Function in Deep Reinforcement Learning for Trajectory Planning
LI Yue, SHAO Zhenzhou, ZHAO Zhendong, SHI Zhiping, GUAN Yong
计算机工程与应用 . 2020, (2): 226 -232 .  DOI: 10.3778/j.issn.1002-8331.1810-0021