Are RL methods stable with function approximation?

Article from www.avaye.com


Are RL methods stable with function approximation?

The situation is a bit complicated and in flux at present. Stability guarantees depend on the specific algorithm and function approximator, and on the way it is used. This is what we knew as of August 2001:

Since then, the new Perkins and Precup result from NIPS 2002 has appeared, which may have at last resolved the question positively by proving the convergence of Sarsa with linear function approximation and an appropriate exploration regime.


< Back

All content copyrighted by Avaye.com