面向机械臂轨迹规划的强化学习奖励函数设计
靳栋银, 李跃, 邵振洲, 施智平, 关永
Design of Reinforcement Learning Reward Function for Trajectory Planning of Robot Manipulator
JIN Dongyin, LI Yue, SHAO Zhenzhou, SHI Zhiping, GUAN Yong
计算机工程与应用 . 2022, (19): 302 -308 .  DOI: 10.3778/j.issn.1002-8331.2102-0307