Multi-step temporal difference learning algorithm based on recursive least-squares method
CHEN Xue-song,YANG Yi-min
1.Faculty of Applied Mathematics,Guangdong University of Technology,Guangzhou 510006,China 2.Faculty of Automation,Guangdong University of Technology,Guangzhou 510006,China