融合动作退出和软奖励的强化学习知识推理方法
孙崇, 王海荣, 荆博祥, 马赫
Knowledge Reasoning Method of Reinforcement Learning Integrating Action Withdrawal and Soft Reward
SUN Chong, WANG Hairong, JING Boxiang, MA He
计算机工程与应用 . 2024, (24): 158 -165 .  DOI: 10.3778/j.issn.1002-8331.2308-0215