Markov reinforcement learning driven by utility
HAN Wei
Computer Engineering and Applications . 2009, (4): 42 -44 .  DOI: 10.3778/j.issn.1002-8331.2009.04.012