Computer Engineering and Applications ›› 2012, Vol. 48 ›› Issue (23): 44-46.

Previous Articles     Next Articles

Research on reinforcement learning negotiation strategy and its optimization

SUN Tianhao, DENG Junkun, CHEN Fei, ZHU Qingsheng   

  1. College of Computer Science, Chongqing University, Chongqing 400030, China
  • Online:2012-08-11 Published:2012-08-21

基于增强学习协商策略的研究及优化

孙天昊,邓俊昆,陈  飞,朱庆生   

  1. 重庆大学 计算机学院,重庆 400030

Abstract: Reinforcement learning can help negotiation agent to select its best actions and reach its final goal. Agent of traditional reinforcement learning negotiation strategy significantly compromises at the beginning of negotiation, which is irrational, loses touch with reality, and greatly reduces expectation of Agent. Expectation restoration rate is used to restore the true expectations of agent to optimize the negotiation strategy;the impact of value of expectation restoration rate on negotiation process is discussed; experimental results show that optimized negotiation strategy improves the quality of the negotiation result, while ensuring negotiation efficiency.

Key words: negotiation strategy, reinforcement learning, expectation restoration rate

摘要: 增强学习在电子商务中可以帮助Agent选择最优行动,并达成目标。在传统增强学习协商策略中,Agent一开始便进行大幅度的妥协,这是不合理的,与现实不符,降低了Agent的期望。通过期望还原率来还原Agent的真实期望,对协商策略进行优化;讨论了期望还原率的取值对协商过程的影响;通过实验验证了优化的协商策略在保证协商效率的同时,提高了协商解的质量。

关键词: 协商策略, 增强学习, 期望还原率