||Reinforcement Learning and
Edited by Alborz Geramifard
The ambition of this
page is to provide a paper bank for RL related papers together with
people's comments and references..
Lagoudakis and R. Parr. "Model-Free Least-Squares policy iteration",
Machine Learning 4(2003) 1107-1149, 2003
- Boyan, J. A. "Technical
Update: Least-Squares Temporal Difference Learning." Machine
Learning 49:233-246, 2002.
- Xin X, He H. and Hu D. "Efficient
Reinforcement Learning Using Recursive Least-Squares Methods", Journal
of Artificial Intelligence Research, Vol.16,2002, pp:259-292
- R. Schoknecht and A. Merke. Convergent
combinations of reinforcement learning with function approximation. In
Advances in Neural Information Processing Systems, volume 15, 2003.
- R. Schoknecht and A. Merke. TD(0) converges provably
faster than the residual gradient algorithm. In ICML, pp 680–687, 2003.
Feature Discovery and Feature