考虑时变奖励的多摇臂算法在动态定价中的应用
乔勋双,毕文杰
Application of Multi-armed Bandit Algorithm with Time-Varying Rewardsin Dynamic Pricing
QIAO Xunshuang, BI Wenjie
计算机工程与应用 . 2021, (12): 237 -242 .  DOI: 10.3778/j.issn.1002-8331.2003-0382