基于内在奖励的强化学习推荐探索策略
庾源清, 马为之, 张敏
Exploration Strategy in Reinforcement Learning Based on Intrinsic Reward for Recommendation
YU Yuanqing, MA Weizhi, ZHANG Min
计算机工程与应用 . 2025, (7): 188 -195 .  DOI: 10.3778/j.issn.1002-8331.2311-0037