Most RL work assumes the action space is discrete; what about continuous actions?
What about continuous time?
What is the curse of dimensionality?
Isn't tile-coding just grids, and thus subject to the curse of dimensionality?
Why do you call it "tile coding" instead of "CMACs"?
Are RL methods stable with function approximation?
All content copyrighted by Avaye.com