A Multi-Step Reinforcement Learning Algorithm

Abstract:

Article Preview

Reinforcement learning (RL) is a state or action value based machine learning method which approximately solves large-scale Markov Decision Process (MDP) or Semi-Markov Decision Process (SMDP). A multi-step RL algorithm called Sarsa(,k) is proposed, which is a compromised variation of Sarsa and Sarsa(). It is equivalent to Sarsa if k is 1 and is equivalent to Sarsa() if k is infinite. Sarsa(,k) adjust its performance by setting k value. Two forms of Sarsa(,k), forward view Sarsa(,k) and backward view Sarsa(,k), are constructed and proved equivalent in off-line updating.

Info:

Periodical:

Edited by:

Ran Chen

Pages:

3611-3615

DOI:

10.4028/www.scientific.net/AMM.44-47.3611

Citation:

Z. C. Zhang et al., "A Multi-Step Reinforcement Learning Algorithm", Applied Mechanics and Materials, Vols. 44-47, pp. 3611-3615, 2011

Online since:

December 2010

Export:

Price:

$35.00

In order to see related information, you need to Login.

In order to see related information, you need to Login.