Policy gradient algorithm based on internal structural MPOMDP model
ZHANG Run-mei 1,2,WANG Hao 1,ZHANG You-sheng 1,YAO Hong-liang 1,FANG Chang-sheng 1
Computer Engineering and Applications . 2009, (7): 20 -23 .  DOI: 10.3778/j.issn.1002-8331.2009.07.007